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Editorial 

Eureka 62 


W hen the Archimedeans asked me to edit Eureka for the third 
time, I was a bit sceptical. Issue 60 was the hrst to get a pa- 
perback binding and issue 61 was the hrst to be published in 
full colour and with a new design. How could we make this issue special 
- not just a repeat of the previous one? 

Eureka has always been a magazine for students, not a research journal. 
Articles should be interesting and entertaining to read, and often they 
are a stepping stone into particular problems or areas of mathematics 
which the reader would not usually have encountered. 

Every year we receive many great articles by students and mathemati- 
cians. Our task as editors is often to make them more visually appealing 
- and we can do so using images, diagrams, fonts or colours. 

What we wanted to add in this issue was interactivity, such as videos, 
slideshows, animations or games. Unfortunately this still is quite dif- 
hcult on paper, so we decided to publish a second version of Eureka as 
interactive eBook for mobile devices like iPad. And we hope that this 
will make reading mathematics even more engaging and fun. 


Editors 

Philipp Legner (StJohn's) 
Jack Williams (Clare) 

Assistant Editors 

Stacey Law (Trinity) 

Carina Negreanu (Oueens') 
Katarzyna Kowal (Churchill) 
Douglas Bourbert (Churchill) 
Ram Sarujan (Corpus Christi) 

Subscriptions 

WesleyMok (Trinity) 


The digital version will, for the hrst time, make Eureka available to a 
large number of students outside Cambridge. And therefore we have 
reprinted some of the best articles from previous issues. We spent many 
hours in the library archives, reading old copies of Eureka, though of 
course there are many more great articles we could have included. 

The articles in this issue are on a wide range of topics - from num- 
ber theory to cosmology, from statistics to geometry. Some are very 
technical while others are more recreational, but we hope that there is 
something interesting for everyone. 

I want to thank the editorial team for all their work, and the authors for 
their excellent articles. We hope you will enjoy reading Eureka 62! 


Philipp Legner and Jack Williams 
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The Archimedeans 

YuhanGao, President 2012 — 2013 


T his year was yet another highly successful 
one for The Archimedeans. The society 
welcomed over 150 new members, courtesy 
of a very popular Freshers’ Squash. We hosted a 
number of talks given by speakers from the uni- 
versity over the course of Michaelmas and Lent. 
These covered a number of different topics, cater- 
ing for those with interests in pure, applied and 
applicable mathematics. Highlights included 
talks by Prof. Grae Worster on Ice, and Prof. Imre 
Leader on Games of Pursuit and Evasion. 

The society expanded the range of events which 
we offered to our members this year. We held a 
board games evening, which proved to be a thor- 
oughly enjoyable night for all those who attended. 
One of our most anticipated events was the black- 
tie Annual Dinner in the delightful surroundings 
of the Crowne Plaza Hotel. 

A tradition of the Archimedeans is to hold an an- 
nual Problems Drive. This time around, teams 

The Committee 2012 - 2013 

President 

Yuhan Gao (Trinity) 

Vice-Presidents 

Sean Moss (Trinity) 

DanaMa (Newnham) 

CORPORATE OFFICER 

Joseph Briggs (Trinity) 

Secretary 
Jacquie Hu (Jesus) 


from as far aheld as Oxford came to take part in an 
engaging and entertaining mathematics competi- 
tion. Prizes were awarded not only for the teams 
with the highest scores, but also for particularly 
creative team names. The questions given can be 
found in this journal, and we welcome you to try 
them yourself. 

The year hnished on a high in May Week, courtesy 
of the Science and Engineering Garden Party. Six 
societies from the university joined together to 
host a brilliant afternoon of fun, aided by a jazz 
band. Finger food and Pimms was served, and 
there was even a cheese bar on offer. 

We would like to thank our members for contrib- 
uting to an excellent year for the society. I would 
also like to thank the committee for all of their 
hard work, and Philipp Kleppmann, last years 
President, along with the previous committee, for 
everything which they have done for the society. 
We look forward to another exciting year ahead. 


Treasurer 

Colin Egan (Gomille and Caius) 

Events Managers 

Pawel Rzemieniecki (Fitzwilliam) 

YumingMei (Emmanuel) 

PUBLICITY OFFICER 

James Bell (Gomille and Caius) 

Webmaster 

Ben Millwood (Downing) 
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I love computer languages. In fact, I’ve spent 
roughly half my life nurturing one particular 
very rich computer language: Mathematica. 

But do we really need computer languages to tell 
our computers what to do? Why cant we just use 
natural human languages, like English, instead? 

If you had asked me a few years ago, I would 
have said it was hopeless. That perhaps one could 
make toy examples, but that ultimately natural 
language just wouldn’t be up to the task of creat- 
ing useful programs. 

But then along came Wolfram|Alpha in which 
weve been able to make free-form linguistics 
work vastly better than I ever thought possible. 

But still, in Wolfram|Alpha the input is essen- 
tially just set up to request knowledge - and 


Wolfram|Alpha responds by computing and 
presenting whatever knowledge is requested. But 
programming is different. It is not about gen- 
erating static knowledge, but about generating 
programs that can take a range of inputs, and 
dynamically perform operations. 

The hrst question is: how might we represent 
these programs? In principle we could use pretty 
much any programming language. But to make 
things practical, particularly at the beginning, 
we need a programming language with a couple 
of key characteristics. 

The most important is that programs a user might 
specify with short pieces of natural language 
must typically be short - and readable - in the 
computer language. Because otherwise the user 
wontbe able to tell - atleast not easily- whether 
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Smallest pertect number. Hexagonal tilings give 
the densest 'sphere' packing in two dimensions. 



the program thats been produced actually does 
what they want. 

A second, somewhat related criterion is that it 
must be possible for arbitrary program frag- 
ments to stand alone - so that large programs 
can realistically be built up incrementally, much 
like a description in natural language is built up 
incrementally with sentences and the like. 

To get the hrst of these characteristics requires 
a very high-level language, in which there are 
already many constructs already built in to the 
language - and well enough designed that they 
all fit together without messy “glue” code. 

And to get the second characteristic essentially 
requires a symbolic language, in which any piece 
of any program is always a meaningful symbolic 
expression. 

Conveniently enough, there is one language that 
satisfies rather well both these requirements: 
Mathematica! 


The linguistic capabilities of Wolfram|Alpha give 
one the idea that one might be able to under- 
stand free-form natural language specifications 
of programs. Mathematica is what gives one the 
idea that there might be a reasonable target for 
programs generated automatically from natural 
language. 

For me, there was also a third motivating idea - 
that came from my work on A New Kind of Sci- 
ence. One might have thought that to perform 
any kind of complex task would always require 
a complex program. But what I learned in A New 
Kind ofScience is that simple programs can often 
do highly complex things. 

And the result of this is that its often possible to 
find useful programs just by searching for them 
in the computational universe of possible pro- 
grams - a technique that we use with increas- 
ing frequency in the actual development of both 
Wolfram|Alpha and Mathematica. 
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Number of hills in Rome and Wonders of the 
Ancient World. Secret agent number of James Bond. 
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And it was this that made me think that - even if 
all else failed - one might be able to “synthesize” 
programs from natural language just by search- 
ing for them. 

OK - so there are reasons to hope that it might be 
possible to use natural language input to do pro- 
gramming. But can one actually make it work? 

Even when Wolfram|Alpha was launched, I 
still wasnt sure. But as we worked on bringing 
Wolfram| Alpha together with Mathematica, I got 
more and more optimistic. 

And with Mathematica 8 we have launched the 
hrst production example. It is certainly not the 
end of the story, but I think its a really good 
beginning. And I know that even as an expert 
Mathematica programmer, I’ve started routinely 
using natural language input for certain steps in 
writing programs. 

One can also specify programs in natural lan- 
guage to apply to things one has constructed in 
Mathematica. And in a Mathematica session, one 
can discard the natural language and just use the 
generated code by clicking that code. Some inter- 
esting examples are shown above. 

Now, of course, there are many issues - for exam- 
ple about disambiguation. But the good news is 
that we’ve got schemes for addressing these that 
we’ve been able to test out well in Wolfram| Alpha. 


I have to say that something I thought would be a 
big issue is the vagueness of natural language. That 
one particular natural language input might equal- 
ly well refer to many different precise programs. 

And I had imagined it would be a routine thing 
to have to generate test examples for the user in 
order to be able to choose between different pos- 
sible programs. 

But in reality this seems to be quite rare: there is 
usually an “obvious” interpretation, that in typi- 
cal Wolfram|Alpha style, one can put hrst, with 
the less obvious interpretations a click away. 

So how well does this all work? We have built out 
some particular areas of program functionality, 
and we will progressively be building out many 
more as time goes on. 

They are primarily set up to work in Mathemat- 
ica. But actually you can see most of them in 
some form just on the Wolfram|Alpha website 
- though obviously no references to variables or 
other parts of a Mathematica session can be used. 

How robust is it all? Its dehnitely usable, but I 
would certainly like it to be more robust - and we 
will be working hard in that direction. 

One issue that we have faced is a lack of linguistic 
corpora in the area. We’ve scoured a couple of 
decades of our own tech support logs, as well as 
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many programming forums, to try to find natu- 
ral language descriptions matched with precise 
programs. But we haven’t be able to apply any- 
thing like the same level of automatic filtering 
to this process as we’ve been able to apply in 
many other areas of “linguistic discovery” for 
Wolfram|Alpha. 

There are zillions of fascinating research projects 
to do in figuring out generalized grammars for 
specifying different kinds of programming con- 
structs in natural language - and I will look for- 
ward to seeing this field of inquiry develop. 

We now have another important source of data: 
actual examples of natural language program- 
ming being done in Mathematica. And look- 
ing at our real-time monitoring system for the 
Wolfram|Alpha server infrastructure, I can see 
that very soon we are going to have a lot of data 
to study. 

How far will it be possible to get with natural 
language programming? Even six months ago 
I thought it was only going to be possible to do 
fairly simple examples. But seeing what we have 
actually been able to build, I am extremely opti- 
mistic about what will be possible. 

The hope would be that in the end one will just 
have to describe in natural language the goal for 
one s program - and then an actual program that 
achieves that goal will be synthesized. Some- 
times this will directly be possible from under- 
standing the specihcation of the goal. Sometimes 
to create the necessary program will require a 
whole program-creation process - probably of- 
ten involving searching for an appropriate pro- 
gram in a space of possible programs, in the style 
of A New Kind ofScience. 

It will be important to do program simplification 
- again often achieved by program search - in or- 
der to be able to get the simplest and most read- 
able (and perhaps the most efficient) program 
that meets the requirements that have been given. 

At this point, I am still concerned about how 
much of this will be possible in “interactive times” 
of a few seconds. But if history is a guide, with 
good algorithms and heuristics, and a healthy 
dose of large-scale parallelism, itTl gradually be 
possible to get the times down. 

So what will be the result? I expect natural lan- 
guage programming will eventually become 


This orticle is reprinted from his blog at 
blog.stephenwolfram.com with kind 
permission ofStephen Wolfram. 


ubiquitous as a way of telling computers what to 
do. People will be able to get started in doing pro- 
gramming-like tasks without learning anything 
about official “programming” and programming 
languages: theyTl just converse with their com- 
puters as they might converse with another per- 
son. 

What will happen to programming languages? 
Actually, I think theyTl become much more vis- 
ible and widely known than ever before. Because 
in natural language programming interfaces one 
will probably be shown the programming lan- 
guage code thats being synthesized. 

People will see that, and gradually learn cases 
where its much faster and more precise just 
to enter code like that directly, without going 
through natural language. 

By the way: in Mathematica we are beginning to 
have code generation capabilities for low-level 
languages like C. So it s going to be technically 
possible to go all the way from natural language 
input down to something like C. And for some 
practical purposes - especially with embedded 
systems - that will no doubt be quite useful. 

But when it comes to doing traditional program- 
ming alongside natural language programming, 
there s going to be a great premium on having a 
succinct readable programming language - like 
Mathematica. 

With the free-form linguistics of Mathematica 
we are at the first step in a long journey. But it is a 
journey ITn now confident we can take. After so 
many years, the science-fiction concept of being 
able to tell a computer what to do by using plain 
human language is gradually going to become 
reality - in a way that fascinatingly coexists with 
whats been achieved in high-level computer lan- 
guages. 


Number of muses in Greek mythology. 
Exponential tactorial, since 9 = 3 2 \ 
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Base of our number system. Sum of the first three 
primes, first four integers and first four factorials. 
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Pentaplexity 

Sir Roger Penrose 


First published in issue 39,1978 

C ertain shapes, when matched correctly, 
can form a tiling of the entire plane but in 
a non-periodic way. These tilings have a 
number of remarkable properties, and I shall give 
here a brief account explaining how these tiles 
came about and indicating some of their proper- 
ties. 

The starting point was the observation that a regu- 
lar pentagon can be subdivided into six smaller 
ones, leaving only five slim triangular gaps. This is 
familiar as part of the usual “net” which folds into 
a regular dodecahedron, as shown in Figure 1. 
Imagine now, that this process is repeated a large 
number of times, where at each stage the penta- 
gons of the figure are subdivided according to the 
scheme of Figure 1. There will be gaps appearing 


of varying shapes and we wish to see how best to 
fill these. At the second stage of subdivision, dia- 
mond-shaped gaps appear between the pentagons 
(Figure 2). At the third, these diamonds grow 
“spikes”, but it is possible to find room, within 
each such “spiky diamond”, for another pentagon, 
so that the gap separates into a star (pentagram) 
and a “paper boat” (or Jesters cap?) as shown in 
Figure 3. At the next stage, the star and the boat 
also grow spikes, and, likewise, we can find room 
for new pentagons within them, the remaining 
gaps being new stars and boats (as before). These 
subdivisions are shown in Figure 4. 

Since no new shapes are now introduced at sub- 
sequent stages, we can envisage this subdivision 
process proceeding indefinitely. At each stage, the 
scale of the shapes can be expanded outwards 
so that the new pentagons that arise become the 



▲ Figure 1 


▲ Figure2 



▲ Figure 3 


First Apollo mission to land of the moon. 
Number of space-time dimensions in M-theory. 
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▲ Figure 6 


same size as those at the previous stage. As things 
stand, however, this procedure allows ambiguity 
that we would like to remove. The subdivisions of 
a “spiky diamond” can be achieved in two ways, 
since there are two alternate positions for the pen- 
tagon. Let us insist on just one of these, the rule 
being that given in Figure 5. (When we examine 
the pattern of surrounding pentagons we neces- 
sarily hnd that they are arranged in the type of 
conhguration shown in Figure 5.) It may be men- 
tioned that had the opposite rule been adapted for 
subdividing a “spiky diamond”, then a contradic- 
tion would appear at the next stage of subdivision, 
but this never happens with the rule of Figure 5. 

This procedure, when continued to the limit, leads 
to a tiling of entire plane with pentagons, dia- 
monds, boats and stars. But there are many “in- 
correct” tilings with the same shapes, being not 
constructed according to the above prescription. 
In fact, “correctness” can be forced by adopting 
suitable matching rules. The clearest way to de- 
pict these rules is to modify the shapes to make a 
kind of inhnite jigsaw puzzle, where a suggested 
such modihcation is given in Figure 6. It is not 
hard to show that any tiling with these six shapes 
is forced to have a hierarchical structure of the 
type just described. 

Properties of these Tilings 

Furthermore, the forced hierarchical nature of 
this pattern has a number of very remarkable 
properties. In the hrst place, it is necessarily non- 
periodic (i.e. without any period parallelogram). 
More about this later. Secondly, though the com- 
pleted pattern is not uniquely determined - for 
there are 2 K ° different arrangements - these dif- 
ferent arrangements are, in a certain “hnite” sense, 
all indistinguishable from one another! Thus, no 
matter how large a hnite portion is selected in one 
such pattern, this hnite portion will appear some- 
where in every other completed pattern (inhnitely 
many times, in fact). Thirdly, there are many un- 
expected and aesthetically pleasing features that 
these patterns exhibit (see Figure 7). For exam- 
ple, there are many regular decagons appearing, 
which tend to overlap in places. Each decagon is 
surrounded by a ring of twelve pentagons, and 
there are larger rings of various kinds also. Note 
that every straight line segment of the pattern ex- 
tends outwards to inhnity, to contain an inhnite 
number of line segments of the hgure. The hier- 
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▲ Figure 7 


▲ Figure8 


archical arrangement of Figure 7 is brought out 
in Figure 8. 

After I had found this set of six tiles that forces 
non-periodicity, it was pointed out to me (by Si- 
mon Kochen) that Raphael Robinson had, a num- 
ber of years earlier, also found a (quite different) 
set of six tiles that forces non-periodicity. But it 
occurred to me that with my tiles one can do bet- 
ter. If, for example, the third “pentagon” shape is 
eliminated by being joined at two places to the 
“diamond” and at one place to the bottom of the 
“boat”, then a set offive tiles is obtained that forces 
non-periodicity. It was not hard to reduce this 
number still further to four. And then, with a lit- 
tle slicing and rejoining, to two\ 

The two tiles so obtained are called “kites” and 
“darts”, names suggested by John Conway. The pre- 
cise shapes are illustrated in Figure 9. The match- 
ing rules are also shown, where vertices of the 
same colour must be placed against one another. 
There are many alternative ways to colour these 
tiles to force the correct arrangement. One way 
brings out the relation to the pentagon-diamond- 
boat-star tilings shown in Figure 10. A patch of 
assembled tiles (partly coloured in this way) is 
shown in Figure 11. The hierarchical nature of the 
kite-dart tilings can be seen directly, and is illus- 


trated in Figure 12. Take any such tiling and bisect 
each dart symmetrically with a straight line seg- 
ment. The resulting half-darts and kites can then 
be collected together to make darts and kites on 
a slightly larger scale: two half-darts and one kite 
make a large dart; two half-darts and two kites 
make a large kite. It is not hard to convince one- 
self that every correctly matched kite-dart tiling 
is assembled in this way. This “inhation” property 
also serves to prove non-periodicity. For suppose 
there were a period parallelogram. The corre- 
sponding inhated kites and darts would also have 
to have the same period parallelogram. Repeat 
the inflation process many times, until the size 
of the resulting inhated kites and darts is greater 
than that of the supposed period parallelogram. 
This gives a contradiction. 

The contradiction with periodicity shows up in 
another striking way. Consider a very large area 
containing d darts and k kites, which is obtained 
referring to the inflation process a large number 
of times. The larger the area, the closer the ratio 
x = k/d of kites to darts will be to satisfying the 
recurrence relation x = (1 + 2x)/(l + x), since, 
on inflation, a dart and two kites make a larger 
kite, while a dart and a kite make larger dart. This 
gives, in the limit of an inhnitely large pattern, 


A torus can be sliced into 13 pieces with just 
three plane cuts. Number of Archimedean solids. 
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▲ Figure 9 



▲ Figure 10 



▲ Figure 11 


* = £ (l + \/5) = (p , the golden ratio! Thus we 
get an irrational relative density of kites to darts 
- which is impossible for a periodic tiling. (This 
is the numerical density. The kite has (p times the 
area of the dart, so the total area covered by kites 
is (p 2 (= 1 + (p ) times that covered by darts.) 

Jigsaws and beyond 

There is another pair of quadrilaterals which, 
with suitable matching rules, tiles the plane only 
non-periodically: a pair of rhombuses as shown 
in Figure 13. A suitable shading is suggested in 
Figure 14, where similarly shaded edges are to be 
matched against each other. In Figure 15, the hier- 
archical relation to the kites and darts is illustrat- 
ed. The rhombuses appear mid-way between one 
kite-dart level and the next inhated kite-dart level. 

Many ditTerent jigsaw puzzle versions of the 
kite-dart pair or the rhombus pair can evidently 
be given. One suggestion for modihed kites and 
darts, in the shape of two birds, is illustrated in 
Figure 16. 

Other modihcations are also possible, such as 
alternative matching rules, suggested by Robert 
Ammann (see Figure 17) which force half the tiles 
to be inverted. 


Many intriguing features of these tilings have 
not been mentioned here, such as the pentago- 
nally-symmetric rings that the stripes of Figure 
14 produce, Conway s classihcation of “holes” in 
kite-dart patterns (i.e. regions surrounded by 
“legal” tilings but which cannot themselves be 
legally hlled), Ammanns three-dimensional ver- 
sion of the rhombuses (four solids that appar- 
ently fill space only non-periodically), Ammanns 
and Conway s analysis of “empires” (the infinite 
system of partly disconnected tiles whose posi- 
tions are forced by a given set of tiles). It is not 
known whether there is a single shape that can tile 
the Euclidean plane non-periodically. For the hy- 
perbolic (Lobachevski) plane a single shape can 
be provided which, in a certain sense, tiles only 
non-periodically (see Figure 18) - but in another 
sense a periodicity (in one direction only) can oc- 
cur. (This remark is partly based on suggestions of 
John Moussouris.) 


References 

1. M. Gardner, Scientific American, January 
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2. R. Penrose, Bull. Inst. Maths. & its Applns. 
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Open meandric number. These are the number of 
meanders in non-self-intersecting oriented curves. 








▲ Figure 13 






▲ Figure 15 




▲ Figure 16 


▲ Figure 17 



▲ Figure 18 


Magic constant in a third order magic square. 
Triangular, hexagonal, pentatope and Bell number. 
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Squared Squares 

Philipp Kleppmann, Corpus Christi 


F igure 1 shows a rectangle that is dissected into 
smaller squares, all of which have different 
sidelengths. Such rectangles are called squared 
rectangles. Of course, rectangles like this one can be 
constructed by trial and error if you have enough time 
or a computer. The task becomes harder if you try to 
produce a squared square. The challenge of hnding 
one arose in the early twentieth century from a prob- 
lem in a mathematical puzzle book called The Can- 
terbury Puzzles [5]. It wasnt even clear that a squared 
square existed, until R. Sprague found one in 1939 [3], 
more than 30 years later. 

In the 1930s, the four Cambridge undergraduates Ro- 
land Brooks, Cedric Smith, Arthur Stone, and William 
Tutte came across this problem and devised some very 
clever methods of producing squared rectangles and 
squares using the theory of electrical networks, some 
of which I will present here. The present-day logo of 
the Trinity Mathematical Society is a squared square, 
in recognition of the four Trinity students. 


The low-tech method 

Draw a rectangle cut up into smaller rectangles, as in 
Figure 2. Squint at it and imagine that it is just a bad 
drawing of a squared rectangle. Assign values x and y 
to the sidelengths of two of the ' squares’ as shown in 
the hgure. From these it is easy to determine all other 
sidelengths: First x + y above the two starting squares, 
then 2x + y to the left, and so on. We have to make sure 
that the two vertical sides of the big rectangle have the 
same length. For this we need (5x + 3y) + (8x + 4 y) = 
(4x + 4 y) + (4x + 5y), i.e. 5x = 2y. So, taking x = 2 and 
y = 5, we get the squared rectangle in Figure 1. 
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Number of pawns in a chess set, and each player 
starts with 16 pieces. Length of bank card numbers. 























This method was used by Arthur Stone to con- 
struct his first squared rectangle [4]. While it is 
easy to apply, it is also luck-dependent. You cant 
count on finding a dissection that can become a 
squaring, and sometimes the equations give nega- 
tive values for some sidelengths. This makes a sys- 
tematic analysis very difficult. For example, if one 
is interested in the smallest number of squares 
that a rectangle can be cut up into, it is not at all 
clear how to show that there are no smaller ones. 
In fact, the smallest number is 9. This was proved 
in [1] using the following more refined method. 

The high-tech method 

Suppose we have a squared rectangle, such as the 
one in Figure 1. We construct a directed graph 
with a vertex for each horizontal line segment and 
an edge for each square. There is an arrow from 
a vertex v to a vertex w if and only if the corre- 
sponding horizontal line segments V and W in 
the rectangle are connected by a square and V is 
above W. We label the arrow with the sidelength 
of this square. Figure 3 illustrates the procedure. 

The graph in Figure 4 is constructed in this way 
from the rectangle in Figure 1. It is called the 
Smith diagram of the rectangle. 

Now call P and Q the poles of the network, and 
interpret the labels of the edges as currents. There 
are a couple of things in the graph that you may 
notice: 

1. For any vertex that isnt a pole, the sum of 
currents entering it is equal to the sum of 
currents Aowing out of it: The sum of the 
sidelengths of squares lying directly above 
one horizontal line segment in the squared 
rectangle is the same as the sum of the 
sidelengths of squares lying directly below it. 

2. The sum of currents around any circuit 
is zero (counting currents in the ‘wrong’ 
direction as negative currents). This is 
because the length of any straight vertical 
path from one horizontal line segment to 
another one is the same, no matter which 
squares it passes through. 

3. The sum of the currents leaving P is equal 
to the sum of the currents entering Q, since 
the lengths of the two horizontal sides of 
the rectangle are equal. 


U 




And - heypresto! - weve built an electric network 
in which the given currents are valid as long as we 
assume that each wire has unit resistance. (1) and 
(2) are called Kirchhoffs laws. 

In fact, this construction works in the other direc- 
tion as well: If we construct an electric network 
satisfying the three conditions above and which 
has different currents along all of its wires, then 
it is a blueprint for a squared rectangle! Of course, 
the network encapsulates the same information as 
the squared rectangle, but it has many advantages 
over the first method. Graphs are well-established 
mathematical objects, so we can fall back on a 
large body of theory. In particular, the theory of 
electric networks can be used for further investi- 
gations. See [1] for more. 

Graphs can be searched systematically. For exam- 
ple, to show that there are no squared rectangles 
consisting of fewer than 9 squares we can search 
all directed graphs with at most 8 (and at least 2) 
edges and try to assign distinct values to the edges 
in such a way that the three conditions are sat- 
isfied. But there arent any [1]! The rectangle in 
Figure 1 consists of 9 squares, so it is proved that 
the smallest number of squares in a squared rec- 
tangle is 9. 


The "least random number". 
Number of wallpaper groups. 
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A squared square 

One way of finding a squared square is to exhaus- 
tively search through all squared rectangles, until 
you spot one. The smallest has 21 pieces [3], so 
this might take a while. It turns out that looking 
for a squared rectangle that can be cut up into 
squares in two completely different ways (mean- 
ing that none of the squares of one dissection ap- 
pear in the other dissection) will get you there a 
lot faster. 

The smallest such rectangle is 422 x 593 and can 
be cut up into 13 pieces in two different ways [4]. 
The sidelengths of the component squares are 18, 
38, 49, 67, 72, 85, 103,116, 154,175, 192, 230, 247 
and 2, 22, 37, 39, 41, 43, 80, 164, 178, 200, 207, 
215, 222, respectively. The two rectangles are 
combined with two squares to form one large 
squared square, as shown in below. You might 
like to assign the sidelengths to the component 
squares yourseh! 
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Sum of the first three pentagonal numbers, 
and thus a "pentagonal pyramidal number". 
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Tricky Teacups 

Vito Videtta,Trinity Hall 


H ere is a puzzle I found when yolunteering 
at maths outreach events in Cambridge: 
it is called Auntys Teacups. We are given 
16 teacups, four each in each of four colours, and, 
similarly, 16 saucers. Arrange the cups on top of 
the saucers in a 4 by 4 square grid so that: 

1. in each row and column, there is one cup of 
each colour; 

2. in each row and column, there is one saucer 
of each colour; 

3. (Orthogonality Condition) no cup-saucer 
colour combination is repeated. (To be clear, 
this means that, for example, red-on-green 
and green-on-red are both allowed.) 

I was rather intrigued when I hrst saw this puzzle; 
it appeared to be so simple, yet everybody who 
tried it quickly found out it was a Pandoras Box. 
It was apparent that I was hooked as soon as I got 
home; I immediately started working on a solu- 
tion. Before we get to that though (and to give you 
a chance to try it for yourself), lets go through 
some of the history of this problem. 

Square arrangements of the above type were hrst 
studied by Leonhard Euler. In a seminal paper 
published in 1782, he poses the following prob- 
lem: £ Given a group of 36 othcers of six different 
ranks, one each from six different regiments, is it 
possible to arrange the othcers in a square, in such 
a manner that in each line, vertical or horizontal, 
there is one othcer of each rank and one from each 
regiment?’ This problem came to be known as “Le 
Problĕme des 36 Othciers”; despite its apparently 
simple formulation, it turned out to be one of the 


hardest mathematical problems ever posed. Con- 
sequently, this led to a barrage of new mathemat- 
ics being created as more and more mathemati- 
cians tried to rise up to Eulers Challenge. 

As with much great mathematics, this particular 
problem managed to find its way into popular cul- 
ture in the form of many puzzles. One version that 
is particularly simple to set up requires a pack of 
cards. Take the Ace, King, Queen and Jack of each 
suit. Arrange the cards in a 4 by 4 square grid so 
that in each row and column, there is one card 
of each rank and one of each suit. You can easily 
see that this puzzle is equivalent to the Teacups 
problem (in fact, it is slightly easier, since we don t 
have to worry about the orthogonality condition). 

Journey towards a solution 

A Latin square of order n is an nxn matrix con- 
taining n copies of the numbers 1 to n arranged 
so that in each row and column, each number 
appears once and only once. Latin squares arise 
naturally as the multiplication tables of finite 
groups (of course, the symbols we use to label the 
square are unimportant). Suppose A and B are 
Latin squares of order n\ we say that A and B are 
orthogonal if (A^, By) ranges through all possible 
ordered pairs as i,j range through all legal indices. 
Moreover, we call B an orthogonal mate to A. In a sense, 
orthogonal Latin squares (OLSs) are Latin squares 
that are as different as possible from each other. 

We now see how to translate the Aunty s Teacups 
problem into mathematics; we are merely search- 
ing for a pair of OLSs of order 4. 
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The number of quarter or half turns required to 
optimally solve a Rubik's cube in the worst case. 



I began my investigation by, rather shamefully, 
writing a computer program to hnd all possible 
solutions. I started by labelling the four colours 
using the numbers 1 to 4 and using the notation 
(c, s) to describe the entries in the 4-by-4 matrix, 
where c is the cup colour and s is the saucer colour. 
After five hours of coding and a split second of 
computing, it spat out a completely unintelligible 
sequence of the numbers 1 to 4 and, somewhat 
more importantly, the number of solutions it had 
found: 6912. That was nice, but uninformative. 

And so I left the problem there for a while, in 
which time I hnished my hrst two years at Uni- 
versity. But when I showed the puzzle to a young 
girl and her father at another outreach event, an 
obvious fact hit me with more force than a meteor 
strike: given any solution, I could cyclically per- 
mute the rows and columns to generate new solu- 
tions. Topologically then, the grid can be wrapped 
around into a tube and the ends connected to 
make a doughnut; a sort of “Teacup Torus”. 

In fact, I could act on the solution space X by the 
group S 4 xS 4 via row and columns swaps on any 
solution. I called this group the Automorphism 
group of X. (This terminology is non-standard; 
I chose it purely by analogy with Galois theory.) 
This allowed me to dehne an equivalence relation 
~ on the space of solutions with the equivalence 
classes precisely the orbits of the action: we say x ~ 
y iff there is o e S 4 x S 4 such that x = a(y) where o = 
(ai, a 2 ) acts ony by permuting the rows via o x and 
the columns via o 2 . Having dehned this equiva- 
lence relation, it then became very natural to ask 
if there was a convenient choice of representa- 


tive for each equivalence class. The answer here 
is ‘yes; by permuting rows and columns, I could 
always transform any given solution to the form 


(1.1) 

(2,?) 

(3,?) 

(4,?) 

(2,?) 

? 

? 

? 

(3,?) 

? 

? 

? 

(4,?) 

? 

? 

? 


Moreover, the action has the following highly de- 
sirable property: if x, y e X are distinct solutions 
in the same orbit, then there is a unique o e S 4 x S 4 
such that x = o(y). Hence, each equivalence class 
has size (4!) 2 = 576 and so this gives 6912/576 = 
12 different equivalence classes. This was a ma- 
jor step towards the solution, but I was still not 
satished; 12 was “too big” and I felt that I could 
quotient out more group actions from the solu- 
tion space. 

Again, progress fell silent as I was revising for my 
hnal exams. The hnal piece of the puzzle came 
to me as I walked back to college along Burrells 
Walk one evening early in May. I had been revis- 
ing Galois theory, which got me thinking about 
groups and (inevitably) a certain puzzle involving 
pieces of hne china. I was thinking about how I 
could incorporate permutations of colour into my 
solution; clearly, I had completely exhausted all 
possible permutations of rows and columns and 
so colour permutations stood as the hnal frontier. 
The problem I faced was that in the top-left hand 
corner of the above scheme, colour 1 had already 
been fixed in the second coordinate. It occurred 
to me while I was walking that, actually, this did 
not present a problem : justpermute the remaining 


Smallest Fibonacci number whose digits and digit 
sum are also Fibonacci. Magic number in Blackjack. 
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colours in the second coordinate among themselves. 
By doing so, I could force any solution to assume 
the form 


(1.1) 

(2,?) 

(3,?) 

(4,?) 

(2,2) 

? 

? 

? 

(3,3) 

? 

? 

? 

(4,4) 

? 

? 

? 


Hence, I had the hnal automorphism group: 
Aut(X) = S 4 xS 4 xS 3 , a group of size 3456. It re- 
mained to fmd representatives for the equivalence 
classes. After a bit of searching, they presented 
themselves: 


(1,1) 

(2,3) 

(3,4) 

(4,2) 

(2,2) 

(1,4) 

(4,3) 

(3,1) 

(3,3) 

(4,1) 

(1,2) 

(2,4) 

(4,4) 

(3,2) 

(2,1) 

(1,3) 


(1,1) 

(2,4) 

(3,2) 

(4,3) 

(2,2) 

(1,3) 

(4,1) 

(3,4) 

(3,3) 

(4,2) 

(1,4) 

(2,1) 

(4,4) 

(3,1) 

(2,3) 

(1,2) 


This was exactly what I was expecting: there are 
two equivalence classes resulting from this action. 
Also, the above solutions lie in different orbits, 
which preserves the uniqueness of action. Hence, 
I had 2x3456 = 6912 solutions altogether, which 
agreed with my computer search. 

Designing Experiments 

All of the above discussion has a distinctly pure 
flavour and may have made some of my applied 
readers slightly nauseous. Fear not my friends, we 
now present an application of the above theory. 

Suppose that the brilliant genius Prof. Tarquin 
Walter Kornman is organising examinations for 
his students at Camford University. He is cur- 
rently arranging an exam timetable for his four 
students, Alice, Daniel, Grace and Timothy, who 
must each take four papers. Tarquin aims to de- 
sign a timetable that is as ethcient as possible, but 
considers it a form of cruelty to force a student to 
sit more than one paper per day. However, in spite 
of his compassion, his genius lends him certain 
eccentricities, chief among which is an extreme 
desire to eliminate bias as far as possible. He does 
so by choosing, for each day, four diAerent exami- 
nation start times. Given this information, how 
should Tarquin finish the exam timetable? 


Here, Tarquin has four collections of symbols, or 
treatments , to deal with: Start Time, Exam Date, 
Student Name and Paper Number. It turns out 
that the most efficient and unbiased arrangement 
uses a pair of OLSs of order 4, preferably chosen 
at random. All that Tarquin need do is to specify 
which treatment will label the rows, which will la- 
bel the columns and finally to use the other two 
treatments as sets of symbols with which to fill the 
square. For example, one solution might be to use 
the Student Names as row labels, the Exam Dates 
as column labels and then to fill the table with the 
Paper Numbers and Start Times. A similar strat- 
egy will work for any similar experiment; such 
arrangements are called Pairwise Balanced Designs 
and are frequently used to ensure efficiency and 
elimination of bias when designing an experiment. 

Now, suppose that Tarquin is dissatisfied with the 
way the examinations were run this year. He be- 
lieves that the students had not been examined 
thoroughly enough, so decides to make the exams 
harder by introducing two new papers. By sheer 
luck, two new students join the course the fol- 
lowing year, so that now we have six papers to be 
taken by six students over six days, each at six dif- 
ferent starting times. Tarquin, now knowing that 
the key lies in orthogonal latin squares, proceeds 
to construct a solution. However, after many long 
and painful attempts, he is reduced to a blubber- 
ing wreck on his othce floor, since he is unable 
to find one. He has convinced himself of his own 
stupidity, but he need not be so harsh on himself... 

Euler's Conjecture: 
or,Tarquin redeemed 

Earlier I mentioned the famous “Problĕme des 36 
Othciers” that was first stated by Euler. Like Tar- 
quin, Euler devoted much effort to solving this 
problem, but was also unable to find a solution. 
Euler had already developed techniques to con- 
struct pairs of Orthogonal Latin Squares of order 
n for n odd or divisible by 4. Given the (trivial) 
impossibility of the case n-2 and his unsuccess- 
ful attempts for n - 6, Euler made a bold claim: 
There do not exist a pair of Orthogonal Latin 
Squares of order At + 2, for any t > 0. One can 
only imagine Euler s reasoning for making such 
an extreme claim on the back of two pieces of evi- 
dence; I daresay it smacks of Physicists’ Induction. 

A first cry of success came in 1900, when Gaston 
Tarry published a proof that the case n = 6 is in- 
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Number of partitions of 8. Pentagonal and a centred 
heptagonal number. 22/7 is an approximation for tt. 

























deed impossible. Tarry was born in France and 
moved to Algeria to work as an administrator. Al- 
though an amateur, he had an amazing capacity 
for combinatorial problems. His proof proceeds 
as follows: he began by reducing all order 6 Lat- 
in Squares to 17 types, via careful and painfully 
detailed reasoning about cycle types in S 6 . From 
there, he reasoned that for a solution to exist, 
we must hnd such a Latin Square that possesses 
a complete set of transversals. A transversal is a 
subset of n cells within an nxn square, labelled 
with the symbols 1 to n, such that in each row and 
column of the square there is a cell of the subset. 
At that point, he took a minor detour to give an in- 
teresting study of the n-4 case using transversals. 
Returning to the proof, he managed to reduce to 3 
the number of Latin Squares that need to be con- 
sidered by showing that some cells of some Latin 
Squares cannot be part of any transversal. Finally, 
he considered these three cases in turn, using a 
greedy algorithm which he calls The Method of 
Order to show that no orthogonal mate can exist 
for any of them. While we can be glad that this 
proof resolves the problem, it does lack a certain 
panache; it tells us that no orthogonal mate can 
exist, but doesnt tell us why it cant exist. Indeed, 
even Tarry was disappointed by this; he writes, 
“The method of order, which does not shed any 
light on the problems it resolves, should not be 
used unless we cannot do otherwise; it is a last 
resort”. 

Since Tarrys proof was published, shorter and 
more informative proofs have been found. See, for 
example, Fisher and Yates (1934), Yamamoto 
(1954) or Stinson (1984). Stinsons proof rather 
interestingly highlights a link between Latin 
Squares and coding theory! 

The Fall of Euler's Conjecture 

Following the proof for the case n = 6, the subject 
seemed to have died down slightly. This changed 
in May 1959, when Raj Bose and Sharadchan- 
dra Shrikhande managed to construct a pair 
of Orthogonal Latin Squares of order 22, thus 
disproving Eulers Conjecture. In fact, it didnt 
stop there; shortly after that paper was published, 
Ernest Parker published a paper in which he 
presented an example with n = 10. Eulers Conjec- 
ture was crumbling fast; in fact, he couldnt have 
been more wrong. In one hnal paper published 
by Bose and Shrikhande in 1960, they proved 
that there exist a pair of OLSs of order 4t + 2 for 


inhnitely many t. To add insult to injury, the proof 
didnt even use very advanced techniques; it relied 
on fairly straightforward properties of hnite helds 
and techniques from Combinatorial Design theory. 

The sudden success with which Eulers Conjec- 
ture was disproved sparked a new wave of inter- 
est in Latin Squares. To capitalise on this inter- 
est, Joszef Dĕnes and Donald Keedwell wrote 
a comprehensive volume on Latin Squares in 
1974. It became an instant hit; such was the in- 
terest in Combinatorial Designs that a sequel was 
published 17 years later. Between them, the two 
volumes cover many aspects on the theory and 
applications of Latin Squares and contain no less 
than 4 complete chapters devoted to the idea of 
Orthogonality. Applications included Experimen- 
tal design, Statistics, Error-correcting codes, Al- 
gebra and Geometry, to name only a few. 
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Rrchimedeans Annuol 


Only non-trivial integer n with the property that 
I 2 + ... + n 2 is a pertect sguare, in this case 70 2 . 


1 Dozzling Dice 

You roll a (standard fair six-sided with sides 1 to 6) die an inhnite 
number of times, recording thetotal score attained so far after each 
roll as a sequence. What are the most and least likely numbers to 
appear in this sequence, and with what probabilities do they occur? 


2 SnoppY Surds 


Find all integers such that 


3 Poinful Primes 


Exactly one of the following numbers 
is prime. Which one? 

852,081 1 , 050,589 

967,535 1 , 052,651 

999,917 1 , 073,254 

999,919 1 , 093,411 


is also an integer. 


4 Compelling Convergence 

Determine whether the following series converge or diverge, and determine the value of 
any that converge. 


sin n 1/2 

^ 2 sin n 

2(1 - sin 2 n ) 

n 2 

- J m 2 

n =1 n 

1 - cos4n 










5 Superb Sets 


For each of the following sets, determine whether it is finite, count- 
able or uncountable. Give the explicit sizes of the finite sets, and for 
any uncountable set, determine whether it bijects with R. 

• Group homomorphisms (Z,+) -> (Q,+) 

• Group homomorphisms (Q,+) -> (Z,+) 

• Equivalence relations on Q 

• Sequences in Q that converge to some member of Q 


6 Triumphont Treosures 

The planet Zog has radius 1 has an associated geostationary moon of 
negligible radius. You have followed the evil space-pirate Blackmous- 
tache to this system, in which he has buried his treasure. You know that: 

i. The moon lies a distance X from the planet, with 1 < X < 1.5; 

ii. The centre of Zog is denoted Z; 

--■v iii. The city of Luna lies on the closest point of the planet to the moon; 

iv. The city of Antiluna is antipodal (at the other end of the diameter) 
to Luna; 



7 Curious Coins 


v. The treasure lies at least X away from Antiluna; 

vi. The city of Luna produces so much toxic waste that any point P in 
the planet with a.PZL < a cannot contain the treasure; 

vii. The core of Zog is molten, so the treasure does not lie within it; 

viii. If the core has volume V and surface area A, then 

+ 6^7 cos a > A 2 - 1; 

47r A ' 

ix. If the treasure lies at the poi nt T, then the followinq ine guality 
holds: |TZ| sin (^ TZL) < yj (A 2 - 1) sin 2 a + \ sin 2 (2a) - \ sin(2a) ; 

x. The city of Midi lies exactly halfway between Luna and Antiluna, 
with antipodal city Centra.Then the treasure is known to be in the 
plane containing Luna, Midi and Antiluna, and to be at least as 
close to Midi as to Centra. 

Where is the treasure buried? 


Two players play a gameon an nxn square tableon which coins of diameter 1 are placed 
in turn.The winner is the one who plays the last coin. For which n do you want to play first? 

NB: The coins must hove their centre above the table , must be placed f\at and cannot be stacked. 


Smallest pseudo prime satisfying 7 n = 7 mod n. 
Automorphic, aspiring and non-sociable number 












8 Perceptive Polygons 

Begin with an equilateral triangle of side length 1, and draw 
its circumcircle. About this, circumscribe a square, and then 
draw the circle around this. Repeat this inhnitely many 
times, each time circumscribing a regular n-gon around the 
outermost circle, and then drawing the circumcircle of that, 
forming the new outermost circle. Does this object fit inside 
a circle of radius 100? 



9 Terrible Tricingles 

ln an equilateral triangle with side 
length 1, consider dropping a per- 
pendicular from a vertex onto the 
opposite side. Then, repeat this pro- 
cess, spiralling in clockwise, as in the 
picture. Where (in Cartesian coordi- 
nates, calling the bottom left vertex 
the origin) is the point to which this 
process converges? 



10 Rough Relations 

Let R be a relation that is"anti-transi- 
tive", that is if aRb and bRc, then cRa. 
Then, define f(n ), for n e N, as the least 
m e N such that there exists a set 7, 
with \T\ = n and a,beT=> aRb or bRa, 
so that exactly m unordered pairs 
(s, t) eTx 7have the properties: 


ii. sRtandtRs. 

Find, for all n e N, the value of f(n). 


11 Gorgeous Geometry 

Let C be the mid-point of OD, and let Q lie on the semicircle through D 
with centre C, whose diameter is perpendicular to OD. Points A and B lie 
in the plane of the semicircle, are equidistant from 0 and also from Q.The 
point R completes the rhombus QARB. 

Find the locus of R as Q traverses the semicircle, with the distances OA, OB , 
QB , AR and BR remaining fixed. 
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Only positive number to be directly between a 
square and a cube. Number of sporadic groups. 








12 Mysterious Matchings 

This problem can be solved easily by 5 to 10 year olds: 


1235 ^0 

8738^4 

0000 -> 4 

8317^2 

1101 -> 1 

3275 ^0 

9834 -> 4 

2814 —> 3 

2222 -> 0 

2176 —> 1 

9393 -> 2 

5656 2 

3535 ^0 

8261 -> 3 

7272 -> 0 

0909 -> 4 

9232 -> 1 

7068 -> 4 

1818 —> 4 

7777 -> 0 


3821 r A? 


13 Dozzling Digits 


Hardy and Ramanujan are playing a game, where on each turn, Hardy 
names some digit (which need not be distinct from previous digits), and 
then Ramanujan inserts it into the expression **** - ****, in place of 
one of the stars. 



Down 

1 Equate Rn ions containing 
Hamilton's group. (11) 

Mix paint on hide of Archi- 
medes'cows. (11) 

3 That's sum royal snake! (5) 
6 Also intersection. (3) 


Solutions to the Archimedeans Problems Drive can be found on page 94. 



In the Collaz Conjecture you need 112 steps to 
get from 27 to 1. Sum of the digits of its cube. 


Pf 












Pi in Fours 

John Conway and Michael Guy 


First published in issue 25,1962 

T he famous “four 4s problem” asks you to ar- 
range four 4 s, and any number of the ordi- 
nary mathematical symbols, to give as good 
an approximation to Pi as you can fmd. 


We can also hnd log n in seven 4’s, but as yet we 
have not been able to fmd any formula of this kind 
for Euler s constant y. 

We shall now show that the above devices are un- 
necessary. In fact: 


We shall allow the symbols (,),+,-, x and +, the 
usual notations for roots V and S/, powers, factori- 
als and the decimal notation 44, .4 and .4. Pi itself, 
logarithms and trigonometric hmctions may not 
be used. Fact orials are to be of integers only, oth- 
erwise tt = yj (-+4/ 4 )! 4 . We shall also not allow 
such monstrosities as .\f\. 

For example, 


Theorem 1 Any real number may be approxi- 
mated arbitrarily closely using only four 4 s 
and the usual symbols. 

Proof: It follows from the formula n (\fa - \fb) -* 
log(a / b) that for suthciently large n we have 

2 m < 2f »(42 - ( — i, _ 4 2-<-”)) 

< 2 m + l 



is a very good approximation to e , and can clearly 
be modihed to be as good as we please. It can fur- 
thermore be improved so as to only use three 4 s, 
since, as n oo, n / \fn\ e. 


We may derive similar “explicit” formulae for var- 
ious interesting numbers. Thus n <fa - n log a, 
so that we obtain a sequence of approximations 
to log 2, log 5, and log a fr for a variety of rational 
a and b (e.g. log 10 2 or log 10 3). Our best result of 
this kind for n has seven 4 s, and is derived from 


n 


lim 

n-> oo 


) 4 


for the limit of this expression as n oo is 
2 m log 4, and 1 < log 4 < 2. If now m is any integer 
and n > m, both n - m and n - m - 1 are posi- 
tive, so that we may write the expression above as 
2 n (f n ~ m ~ l A - f n ~ m 4), the indices of the root 
signs indicating repetitions. Taking square roots 
k times, we have 

< 2 ( m+1 )/ 2 \ 

Now we may take n to be of the form 4(\) p so as 
to satisfy all the above conditions, and the the 
expression between the inequality signs will use 
only four 4 s. Since the numbers 2 m/2k for integers 
m and positive integers k are dense in the positive 
real numbers, we have proved our theorem. (For 
a negative number we need merely add another - 
sign.) □ 
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Second pertect number. Sum of the 
totient tunctions of the first nine integers. 












Theorem 2 If we allow use of the integer part 
sign, every integer is representable with four 
fours and every rational number with five. 

The first part is obvious, and the second part be- 
comes a corollary of the first when we note that 
any rational p/q equals m/ 4(!) n for suitable inte- 
gers m and n. □ 

We may modify Theorems 1 and 2 so as to use 
other (positive integral) numbers instead of 4s. 
The only condition is that at most three of these 
maybe ls. 

Finally we pose these questions: 

• Is there an “explicit” formula for n 
with less than seven 4 s? 

• Is there and explicit formula for y? 

• Are the numbers y/ n ( 4(!) m ) dense in x > 1? 



Tetranacci number. Saturn 
requires 29 years to orbit the Sun. 




















W hen Grigori Perelman proved the 1904 
Poincarĕ conjecture in 2003, he gave a 
sketch proof of Thurstons stronger ge- 
ometrisation conjecture , which had been around 
since 1982. Both concern 3-manifolds. But what 
about the geometrisation theorems little sister, 
which deals with 2-manifolds? 

Known as the classijication theorem for closed 
connected sur/aces , this gives a complete list of all 
closed connected 2-manifolds up to homeomor- 
phism, enabling any such surface to be slotted 
into one of two simple categories. 

First announced in 1888, with a proof that turned 
out to be incomplete, the classihcation theorem 
was proved in 1907, albeit assuming triangulabil- 
ity, which was only proved in 1925. 

Nowadays, it crops up in various second and 
third-year Tripos courses, including IB Geometry, 
II Differential Geometry, II Algebraic Topology. 
But only stated, not proved. 


A surface is connected if it s all in one piece; and 
closed if it has no boundary and can be expressed 
as a hnite union of discs. So a cylinder isn t closed, 
and nor is any unbounded surface in 3D, but a 
sphere and torus are. For brevity, we 11 use ‘surface 
to mean closed connected surface’ 

Two surfaces are homeomorphic if there is a con- 
tinuous bijection between them with a continu- 
ous inverse: intuitively, if they are ‘topologically 
equivalent’ in the doughnut-teacup sense. 

We assume all surfaces are triangulable: in other 
words, that any surface is topologically equivalent 
to a polyhedron with flat triangular faces. 

The Euler characteristic is the quantity % = V- E + 
F, where V, E , F are the numbers of vertices, edges, 
and faces of the triangulation. This is an invariant: 
any two triangulations of the same surface have 
the same Euler characteristic. 

And offwe go... 


There is an elementary proof, requiring little more 
than a basic knowledge of triangulations. This was 
first given by Christopher Zeeman in the 1960s. 

Detinitions 

A surface, or 2-manifold, is a topological space 
thats locally homeomorphic to IR 2 . In other words, 
any point has a small neighbourhood which is ap- 
proximately flat. Take for example the surface of 
the Earth: from close up it looks flat, and you need 
to get a long way away to see that it isn t. 


TheTheorem and Proof 

The classiAcation theorem states that any closed 
connected surface S is homeomorphic to one of 
the following: 

• ifits orientable, the sphere with g handles 
glued on, i.e. the g-holed torus, for some g > 0; 

• ififs non-orientable , the sphere with h Mo- 
bius bands sewn in, for some h> 1. 

Gluing on a handle means removing two small 
discs on the sphere and sticking the two edges of 
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lcosahedron and dodecahedron have 30 edges. 
The icosidodecahedron has 30 vertices. 



a hollow cylinder into the gaps. Sewing in a Mo- 
bius band means removing one small disc on the 
sphere and sticking a Mobius band into the gap. 
Recall that a Mobius band has only one edge. 

We 11 need the following two lemmas. 

Lemma 1 The Euler characteristic of any 
surface is at most 2. 

Proof: Note that the Euler characteristic of a 
graph is % = V - E, since a graph has no faces. If 
the graph is a tree, i.e. it has no closed loops, then 
it can be shrunk to a point, so its Euler charac- 
teristic is 1. If it isnt a tree, then removing one 
edge from a closed loop increases the Euler char- 
acteristic by 1, and we can get a tree after hnitely 
many such operations. So for a graph we always 
get*< 1. 

Take a triangulation T of a surface S and consider 
its dual triangulation D, formed by putting a ver- 
tex at the centre of each T-face and a face with 
centre at each T-vertex. Let M be a maximal tree 
in D, dehned as a tree to which no more edges can 
be added without creating a closed loop, and let 
C = D\M. 

Since M is a tree, C is connected. Since M is maxi- 
mal, M contains all vertices of D. So there are bi- 
jections 

{T-triangles} ++ {M-vertices}, 

{C-edges} {D-edges}, 

{C-vertices} ++ {C-vertices}. 

Thereforex(S) = *(M) + x(C) <2. □ 

Lemma 2 If S is a surface which is discon- 
nected by every closed curve on it, then it is 
homeomorphic to the sphere. 

Proof: Let T, D, M, and C be as before. If C con- 
tains a loop, then this loop disconnects S; each 
connected component must contain a D-vertex, 
and any two D-vertices are joined by edges in M. 
Contradiction, so C is a tree. Let X be the set of 
points in S closer to M than to C, and Y be the 
set of points in S closer to C than to M. Each of X 
and Y is a fattening up of a tree, so they are both 
homeomorphic to the disc. But S is just X and Y 
glued together edge-to-edge, so S is homeomor- 
phic to two discs glued edge-to-edge, i.e. to the 
sphere. □ 


We now use the following surgery algorithm on 
an arbitrary surface S. 

1. If S is disconnected by every closed curve 
on it, stop. 

2. If there is a non-disconnecting closed 
curve on S, remove a thin strip around this 
curve; this strip must be a cylinder or a 
Mobius band. 

3. If the strip is a cylinder, glue in two discs 
to the gaps left in S, increasing^(S) by 2, 
and mark both of them with an orientation 
(clockwise or counterclockwise) so that 
they agree along the cylinder. 

4. If the strip is a Mobius band glue in 1 disc 
to the gap left in S, increasing ^(S) by 1. 

5. Go to 1. 

By Lemma 1, the process stops after hnitely many 
steps. By Lemma 2, the surface we get when it 
does stop must be a sphere. 

Now start from a sphere and reverse the process 
to get to S in hnitely many steps. In each step, we 
have three possibilities for what needs to be re- 
placed: 

a. 1 disc; 

b. 2 discs with different orientation (one 
clockwise, one counterclockwise); 

c. 2 discs with the same orientation (both 
clockwise or both counterclockwise). 

If it is (a), were sewing in one Mobius band. If it 
is (b), were gluing in a handle. If it is (c), were 
sewing in a Klein bottle. But a Klein bottle is just 
two Mobius bands, so we can ignore (c) without 
loss of generality. 

So we can get to any surface S by starting with a 
sphere and putting in hnitely many Mobius bands 
and handles. 

If S is orientable, then it can t contain any Mobius 
strips, so it s a sphere with hnitely many handles. 

If it is non-orientable, then we must sew in at least 
one Mobius band. If we ever glue in a handle, then 
we can transport one of the two differently-orient- 
ed discs around this Mobius band so that theyve 
both got the same orientation. This reduces (b) 
to c), which weve seen reduces to (a). So S is a 
sphere with hnitely many Mobius bands. And this 
completes the proof. □ 


Number of musical triads (12 major, 12 minor, 4 
diminished, and 3 augmented). 11111 in base 2. 
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by combining the images of this segment under 
four transformations, each involving a dilation 
of factor 1/3 composed with either or both a ro- 
tation and translation. Combining the images of 
K x under the same four transformations yields 
K 2 , and the Koch curve itself (KJ is the limit 
of this process as it is iterated. (Peitgen, in [4], 
calls this method of drawing fractals the “Mul- 
tiple Reduction Copy Machine” or MRCM.) 

We can see that this object is self-similar, in the 
sense that we can hnd arbitrarily small por- 
tions of the curve that are related to the whole 
by a similarity transformation. 

The fractal fern of Figure 1 is also the limit 
of four alhne transformations iterated in the 
same manner. Since each athne transformation 
may be represented by a 2x2 matrix giving the 
homogenous part of the transformation and a 
2-component vector giving the inhomogeneous 
(translation) part of the transformation, a hg- 
ure that is the limit of n iterated ahine transfor- 
mations can be encoded as a collection of 6 n 
real numbers - a much more ethcient encod- 
ing than a pixel-by-pixel representation. These 
ideas also generalise in an obvious manner to 


First published in issue 57,2005 


U of the study of fractals is that of their 
use in image compression. Such an 
application is not surprising, since seemingly 
complicated and intricate fractal images have 
relatively simple mathematical descriptions 
in terms of iterated mappings. Given also that 
fractals have been found to model well a wide 
variety of natural forms, it seems natural that 
we should try to exploit their self-similar prop- 
erties to encode images of such forms. 

We examine a simple example of a fractal, the 
Koch curve, to illustrate the principle of encod- 
ing a fractal image. Referring to Figure 2, we 
construct the Koch curve by hrst taking a line 
segment of length 1, K 0 . We then construct K x 


Figure 1 Real and 
computer generated Ferns 










sonable approximation of the limiting set. More 
theoretically, we might question whether we can 
be sure that such iterations will indeed tend to a 
dehnite limit, and, given that any such limit will 
be invariant under the iteration, whether it matters 
with what set we start. Could we, for example, have 
begun our construction of the Koch curve with a 
circle rather than a line segment? 

In this article, we shall see that, by considering sub- 
sets of Euclidean space as points in a metric space, 
we can measure how different two images are, and 
by applying the contraction mapping theorem, we 
can see that limit sets of the sort described above 
do exist, that our starting point in their construc- 
tion does not matter, and we can also obtain an 
estimate for how rapid the convergence is. 




Detinitions 

For reference, we enumerate here a few stand- 
ard dehnitions and theorems that we shall use 
later. 

Dehnition 1 A metric space is an ordered 
pair (X, d), where Xis a set and d:XxX^R 
is a function with the following properties: 

(i) d{x,y) > 0 V x,y e X, 

with d{x,y) = 0 if and only if x = y; 

(ii) d(x,y) = d(x,y) V x,y e X; 

(iii) d(x,z) < d(x,y) + d(y,z) V x,y,z e X. 

The notion of convergence of a sequence to a 
limit carries over to metric spaces in an obvi- 
ous way, as does the following related notion: 


Dehnition 4 Let (X,d) be a metric space. 
Then/: X -> X is a contraction if there ex- 
ists a non-negative real number c < 1 such 
that d(f(x),f(y)) < c x d(x,y) for all x,y e A 7 . 

Our central theorem tells us about the be- 
haviour of contractions under iteration (for a 
proof, see, for example, [3]). 

Theorem 5: Contraction Mapping 

Let (X, d) be a non-empty complete 
metric space and/: X ^ X a contraction. 
Then there exists a unique x 0 e X such 
that f(x 0 ) = x 0 , and furthermore, 
hm n _ >00 / n (x) = x 0 for all x e X. 

In the hnal section we will refer to a corollary: 


Dehnition 2 Let (x n ) be a sequence of points 
in a metric space (X, d). We say that (x n ) is 
Cauchy if, given e > 0, there exists an N e N 
such that for all m,n>N, d(x m ,x n ) < e. 


Clearly every convergent sequence is a Cauchy 
sequence. The converse is also true for an im- 
portant class of metric spaces: 


Dehnition 3 A metric space (X,d) is 
complete if every Cauchy sequence in A 7 
converges. 


We remark that the metric space formed by R n 
with the usual Euclidean metric is complete. 


The important properties of compact sets 
which we need are that they are closed and 
bounded. 


For most of the time, we shall restrict our at- 
tention to compact subsets of metric spaces. 


Dehnition 7 Let (X,d) be a metric space. 
Then we say A<z X is compact if every 
covering of A by open sets has a hnite 
subcovering. 


Corollary 6 Let (X,d) be a non-empty 
complete metric space and f: X ^ X such 
that f n is a contraction. Then the same 
conclusions hold as for Theorem 5. 















Hausdorff Distance 

Our starting point is a way of turning a collec- 
tion of subsets of Euclidean space into a com- 
plete metric space, so that we can talk about 
limits and convergence, and make use of the 
considerable information provided by the con- 
traction mapping theorem. The concept we 
require is due to HausdoriT, who formulated 
a notion of ‘distance’ between compact sub- 
sets of a metric space which makes the set of 
compact subsets of a given metric space into a 
metric space itself. Furthermore, if our initial 
metric space is complete, then so is the space 
of compact subsets with the Hausdorff metric. 

We require a further concept before introduc- 
ing the Hausdorff distance itself: 

Dehnition 8 Let A be a subset of a metric 
space (X, d ). The £-collar of A, denoted A e , 
is the set {x e X : 3 a e A with d(a,x) < e}, 
i.e. the set of all points at a distance at most 
e from the set A. 


Dehnition 9 Let A and B be compact 
subsets of a metric space (X, d ). If we write 
p'(A,B) = inf{c > 0 : A c B £ } then the 
Hausdorff distance p (A, B) between A and 
T, is dehned by p (A, B) = max{p' (A, B ), 
p'(B,A)}. 

It follows straightforwardly from the dehnition 
that p' satishes all the axioms for a metric space 
in dehnition 1 except (ii), so the hnal part of 
the dehnition is essentially a symmetrisation. 
An alternative dehnition sometimes used (for 
example in [3]) but which does the same job is 
p (A, B) = p' (A, B) + p' (T, A). The proof that the 
resulting metric space inherits completeness is 
given in [2] and as an exercise in [3]. 


The Hutchinson Operator 

Now that we have some way of measuring 
closeness’ of compact subsets of metric spaces, 
our next task is to show that the iterated trans- 
formation applied in Figure 2 to construct the 
Koch curve is indeed a contraction, so that we 
may apply Theorem 5. The following treatment 
follows quite closely that of [4]. We work in R m . 

We have a collection of athne transformations, 
T l5 T 2 ,..., T n , andat eachiteration we applythe 
transformation 

T : A ^ \J T z A. 

i =1 

(This is known as the Hutchinson operator, 
affer Hutchinson who hrst analysed its prop- 
erties.) We impose the condition that each 
should itself be a contraction with respect to 
the Euclidean metric, with constant q < 1. 

We now show that T is a contraction with 
constant c = ma^lc^, c 2 , ..., c n } on the met- 
ric space of compact subsets of R m equipped 
with the Hausdorff metric. (See diagram below 
for the following.) Let A and B be compact 
subsets of R m with p'(B,A) = S. Then for any 
e > 8 we have B c A e . Clearly then T t B c T { A £ , 
for each /, but since T t is contractive on R m , 
T { A £ c (T x A) £ ., where £ { = c { £ < c£. Hence T { B c 
(TiA) e . c (T x A) C£ , yielding 


(jTiBoU(TiA) c 

i =1 i =1 


So TB c (TA) C£ for all £ > d, and hence p' (TB , TA) 
< c8. Therefore p(TB,TA) < c x p(A,B) and so 
T is indeed a contraction. 


(M) 


(TjA) 














works for any metric space, not just R m with 
the Euclidean metric, we may certainly replace 
the Euclidean metric in the above analysis with 
any other making M m into a complete metric 
space, to be able to draw conclusions about the 
convergence properties of a wider yariety of 
Hutchinson operators. 


Julia Sets 


We conclude with some brief, informal re- 
marks about how these ideas may be applied 
to producing images of another rather famous 
class of fractals. For a given polynomial/: C -> 
C, the Julia Set of/, /(/), is the closure of the 
set of repelling (unstable) fixed and periodic 
points of / This is non-trivially equivalent to 
the definition as the boundary of the basins of 
attraction of the attractive fixed points of/(for 
details see [1]), and the set J(f ) has the prop- 
erty that /(/) = / _1 (/) = /. The most famous 
example of these objects are those associated 
with the mapping /: z z 2 + c for various 
c g C (like the example shown in Figure 3). In 
this case we notice that the inverse mapping 


An important practical observation which can 
be made from the above proof is that the con- 
traction constant calculated for T is equal to 
the largest of the individual contraction con- 
stants of the transformation T f . It is clear from 
the proof that, in general, we can do no better 
than this. In the usual proof of the contraction 
mapping theorem, it is shown that, for a con- 
traction/with constant c, 

d(r{x),r k { x ))<d{xj{x))f-. 

Since this inequality holds for all k, the expres- 
sion c n /(l - c) provides an estimate for how 
quickly the iterations converge to the unique 
fixed point. As might be expected, we see that 
the larger the constant c, the slower the con- 
vergence. Hence the MRCM method of draw- 
ing fractals is only as rapid as is allowed by the 
Teast contractive contraction. It is, however, 
worth remarking that a given transformation 
may or may not be contractive, depending on 
the choice of metric, and that the contraction 
constants will vary according to the metric 
used. Since the notion of Hausdorff distance 









f- l -z»{±VT^c} seems to play the role of a 


non-linear Hutchinson operator, in that each 
point (other than c itself) has two images, and 
the fractal of interest is invariant under the 
transformation. 


repelling periodic point of/(whose period we 
shall denote by p), which in turn will be an 
attractiye periodic point o: z ^ +y/z - c 


; i . M 


We might well then ask whether the mapping is 
contractive. Here a partial answer is suggested 
by the theory of conformal mappings, which 
tells us that for a conformal mappingg: C -> C, 
the approximate scaling in length near a point 
z 0 in C is \g'(z 0 )\. The criterion for a fixedpoint 
z 0 of a mapping g to be attractive, viz. \g\z 0 ) \ < 
1, is therefore the same as the criterion for the 
mapping to be locally contractive. Any point 
close to J(f) is, by definition, close to some 


and/ 2 _1 : z •-» -\fz^c. Hence the iterate Tp 
of the Hutchinson operator T defined by these 
two mappings will be a local contraction, and 
so Corollary 6 suggests that, at least if we con- 
sider sets not c too far’ in terms of Hausdorff 
distance from /(/), the iteration will converge 
in the same manner as for the self-affine frac- 
tals discussed above. In fact the convergence is 
very good, and although after a finite time the 
iterates do not in general approximate all parts 
of the Julia set evenly, this is how many fractal 
packages produce their images of Julia Sets. 


* * 
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Stein's Paradox 

Dr Richard J. Samworth, Statslab Cambridge 


P erhaps the most surprising result in Statis- 
tics arises in a remarkably simple estima- 
tion problem. LetX 1} .. X p be independent 
random yariables, with X t ~ 1) for i = 1, ..., 

p. Writing X = (X x ,..., X p ) r , suppose we want to 
bnd a good estimator 0 = 0(X) of 0 = (0 V ..., 6 p ) T . 
To debne more precisely what is meant by a good 
estimator, we use the language of statistical deci- 
sion theory. We introduce a loss function L(0,0 ), 
which measures the loss incurred when the true 
value of our unknown parameter is 6> and we esti- 
mate it by 0. We will be particularly interested in 
the squared error loss function L(0 , 0) = || 0 - 0|| 2 , 
where || • || denotes the Euclidean norm, but other 
choices, such as the absolute error loss L(0,0) = 
EfLi \0i - 0j\ are of course perfectly possible. 

Now L(0 y 0) is a random quantity, which is not 
ideal for comparing the overall performance of 
two different estimators (as opposed to the loss- 
es they each incur on a particular data set). We 
therefore introduce the risk function 

R(0,0) =E{L(6,6)}. 

If 0 and 0 are both estimators of 0, we say 0 
strictly dominates 0 if R(6,6) < R(6,6) for all 0, 
with strict inequality for some value of 0. In this 
case, we say 0 is inadmissible. If 0 is not strictly 
dominated by any estimator of 0 , it is said to be 
admissible. Notice that admissible estimators 
are not necessarily sensible: for instance, in our 


problem above with p = 1 and the squared error 
loss function, the estimator 0 = 37 (which ignores 
the data!) is admissible. On the other hand, deci- 
sion theory dictates that inadmissible estimators 
can be discarded, and that we should restrict our 
choice of estimator to the set of admissible ones. 

This discussion may seem like overkill in this 
simple problem, because there is a very obvious 
estimator of 0: since all the components of X are 
independent, and E(X f ) = 0 { (in other words X f 
is an unbiased estimator of 0 f ), why not just use 
0°(X) = X? Indeed, this estimator appears to have 
several desirable properties (for example, it is the 
maximum likelihood estimator and the uniform 
minimum variance unbiased estimator), and by 
the early 1950’s, three proofs had emerged to show 
that 0° is admissible for squared error loss when 
p = 1. Nevertheless, Stein (1956) stunned the sta- 
tistical world when he proved that, although 0° is 
admissible for squared error loss when p = 2, it is 
inadmissible whenp > 3. In fact, James and Stein 
(1961) showed that the estimator 

strictly dominates 0°. The proof of this remark- 
able fact is relatively straightforward, and is given 
in the Appendix. 





One of the things that is so surprising about this 
result is that even though all of the components 
of X are independent, the /th component of 6 JS 
depends on all of the components of X. To give 
an unusual example to emphasise the point, sup- 
pose that we were interested in estimating the 
proportion of the US electorate who will vote for 
Barack Obama, the proportion of babies born in 
China that are girls and the proportion of Britons 
with light-coloured eyes. Then our James-Stein 
estimate of the proportion of democratic voters 
depends on our hospital and eye colour data! 
The reader might reasonably complain that in 
the above examples, the data would be binomi- 
ally rather than normally distributed. However, 
one can easily transform binomially distributed 
data so that it is well approximated by a normal 
distribution with unit variance (see the baseball 
example below), and then consider the estimation 
problem on the transformed scale, before apply- 
ing the inverse transform. 

Geometrically, the James-Stein estimator shrinks 
each component of X towards the origin, and it 
is therefore not particularly surprising that the 
biggest improvement in risk over 6 ° comes when 
1 6 1 is close to zero; see Figure 1 for plots of the 
risk functions of 6 ° and 6 JS when p = 5. A simple 
calculation shows that R(6 JS , 0) = 2 for all p > 2, 
so the improvement in risk can be substantial 
whenp is moderate or large. In terms of choosing 
a point to shrink towards, though, there is noth- 
ing special about the origin, and we could equally 
well shrink towards any pre-chosen 6 0 e W using 
the estimator 

In this case, we have R(6 J e s Qy 6-6 0 ) = R(6 Js y 6) y so 
6 e s 0 still strictly dominates 6 ° when p > 3. 

Note that the shrinkage factor in 6 e s 0 becomes 
negative when ||X - 0 O | 2 < p ~ 2, and indeed it 
can be proved that 6 e s is strictly dominated by the 
positive-part James-Stein estimator 


where x + = max (x y 0). The risk of the positive- 
part James-Stein estimator 6 JS = 6+ s 0 is also in- 
cluded in Figure 1 for comparison. Remarkably, 
even the positive-part James-Stein estimator is 
inadmissible, though it cannot be improved by 
much, and it took until Shao and Strawderman 
(1994) to hnd a (still inadmissible!) estimator to 
strictly dominate it. 

Generalisations and 
Related Problems 

It is natural to ask how crucial the normality and 
squared error loss assumptions are to the Stein 
phenomenon. As a consequence of many papers 
written since Steins original masterpiece, it is now 
known that the normality assumption is not criti- 
cal at all; similar (but more complicated) results 
can be proved for very wide classes of distribu- 
tions. The original result can also be generalised 
to different loss functions, but there is an im- 
portant caveat here: the Stein phenomenon only 
holds when we are interested in simultaneous es- 
timation of all components of 6 . If our loss func- 
tion were L(6 y 6) = (6 x - 6^) 2 y for example, then 
we could not improve on 6 °. This explains why it 
wouldn t make much sense to use the James-Stein 
estimator in our bizarre example above; it is in- 
conceivable that we would be simultaneously in- 
terested in three such different quantities to the 
extent that we would want to incorporate all three 
estimation errors into our loss function. 

Although Steins result is very clean to state and 
prove, it may seem somewhat removed from 
practical statistical problems. Nevertheless, the 
idea at the heart of Steins proposal, namely that 
of employing shrinkage to reduce variance (at 
the expense of introducing bias) turns out to be 
a very powerful one that has had a huge impact 
on statistical methodology. In particular, many 
modern statistical models may involve thousands 
or even millions of parameters (e.g. in microar- 
ray experiments in genetics, or fMRI studies in 
neuroimaging); in such circumstances, we would 
almost certainly want estimators to set some of 
the parameters to zero, not only to improve per- 
formance but also to ensure the interpretability of 
the htted model. 
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Sum of five consecutive primes (3 + 5 + 7+11 +13) 
and the first three powers of three (3 + 9 + 27). 
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▲ Figure 1: Risks with respect to squared error loss of 
the usual estimator 6°, the James-Stein estimator 6 JS and 
the positive-part James-Stein estimator 6f when p = 5. 

Another important problem that is closely related 
to estimation is that of constructing a conhdence 
set for 6 , the aim being to give an idea of the un- 
certainty in our estimate of 6. Given a e (0,1), 
an exact (1 - a)-level conhdence set is a subset 
C = C(X) of W such that, whatever the true value 
of 6 , the conhdence set contains it with probability 
exactly 1 - a. The usual, exact (1 - a)-level conh- 
dence set for 6 in our original normal distribution 
set-up is a sphere centred at X More precisely, it is 

C°(X) = {9€KP:||S-X|| 2 <4(a)}, 

where xp( a ) denotes the upper a-point of the 
Xj distribution (in other words, if Z ~ xp> 

F \Z>xl(cc)} = a). But in the light of what we have 
seen in the estimation problem, it is natural to 
consider conhdence sets that are spheres centred 
at 0+ (or O+^eo, for some 6 0 € R p ). Since the distri- 
bution of |- 6 1| 2 depends on \\9\\, we can no 
longer obtain an exact (1 - a)-level conhdence set, 
but it may be possible to construct much smaller 
conhdence sets - using bootstrap methods to ob- 
tain the radius, for example - which still have at 
least (1 -a)-level coverage (e.g. Samworth, 2005). 

A baseball data example 

The following example is adapted from Sam- 
worth (2005). The data in Table 1 give the base- 
ball batting averages (number of hits divided by 
number of times at bat) of p = 9 baseball players, 
all of whom were active in 1990. The source was 
www.baseball-reference.com. For i = 1, ...,p, let n { 
and Z z respectively denote the number of times at 
bat and batting average of the i th player during 


Player 


Z, 


Baines 

415 

0.284 

0.289 

Barheld 

476 

0.246 

0.256 

Bell 

583 

0.254 

0.265 

Biggio 

555 

0.276 

0.287 

Bonds 

519 

0.301 

0.297 

Bonilla 

625 

0.280 

0.279 

Brett 

544 

0.329 

0.305 

Brooks Jr. 

568 

0.266 

0.269 

Browne 

513 

0.267 

0.271 


▲ Table 1: Table showing number of times at 
I bat n ;/ batting average Z, in 1990, and career 

batting average tt■„ of p = 9 baseball players. 

the 1990 season. Further, let 7t z denote the player s 
true batting average, taken to be his career batting 
average. (Each player had at least 3000 at bats in 
his career.) We consider the model where Z v ..., 
Z p are independent, with Z z ~ nf l Bin(n z , n t ). 

We make the transformation 

Xi = y/n~i sin -1 (2Z; - 1), 

and let 6 { = y/n~i sin -1 (27 T z - 1), which means that 
X z is approximately distributed as N(6 iy 1). A heu- 
ristic argument (which can be made rigorous) to 
justify this is that by a Taylor expansion applied 
to the function g(x) = y^nj sin -1 (2x - 1), we have 

x, - 0i =g(Zi) -g(jti) *g'(7ti)(Zi - 71 i ) 

_ \/nj(Zi_ ~ Ttj) 
yjlli(l-ni) 

and this latter expression has an approximate 
N( 0,1) distribution when n { is large, by the cen- 
tral limit theorem. In fact, since min z n { > 400, 
an exact calculation gives that the variance of 
each X ; is between 1 and 1.005 for 7r,- € [0.2, 0.8]. 
For our prior guess 6 0 = (0 o ,i> •••> 0 o ,p) T > we take 
6 0 i = \/h sin _1 (27r 0 - 1), with 7T 0 - 0.275 and 
n =p~ l Y?i =i n { . We hndthat ||X-0|| 2 = 2.56, some- 
what below its expected value of around 9, though 
since the variance of a xi random variable is 18, 
this observation is only around 1.5 standard de- 
viations away from its mean. On the other hand, 
\\0+,o 0 - 6\\ 2 = 1.50, so Stein estimation does pro- 
vide an improvement in this case. 
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Letting n = (n ly ..., n p ) and recalling that 6 is a 
function of 7r, the usual 95% conhdence set for n 
is 

{ne [0,lY : \\X-6\\ 2 < 16.9}. 

On the other hand, the 95% conhdence set for n 
constructed using the bootstrap approach is 

{n e [0,1]^ : ||0^0 o (X) - 6\\ 2 < 12.5}. 

Numerical integration gives that the volume ratio 
of the bootstrap conhdence set to the usual con- 
hdence set in this case is 0.26, so the benehts of 
having centred the conhdence set more appropri- 
ately are quite substantial. 
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Appendix 

First note that since || X-9 1| 2 ~ we have R(9‘\ 9) = p for all 9 € R p . To compute the risk of the James- 
Stein estimator, note that we can write 


R(9 JS ,9) =E 


X-9- 


(P~2)X 

ll*ll 2 


= P-2(P-2)^e| 


X,(x, - 0j) 

\\ X \\ 2 


(p - 2)2E (w)' 


Consider the expectation inside the sum when i = 1. We can simplify this expectation by writing it out 
as an n-fold integral, and computing the inner integral by parts: 


E- 


llxp 


r 00 r 00 x\ 

J- 00 ) -00 || x | 2 

/ OO /• OO II /y 11 2 

... / LI_ 

-00 J —00 \\x 




- 2 x\ 


(2 n)Pl 2 


I 2 dx 1 ... dx p 


since the integrated term vanishes. Repeating virtually the same calculation for components i = 2, ..., p, 
we obtain 


S E 


(^(^- 0,01 f [°° I 

m °° ( \\ x \\ 2 - 2 x 2 \ 

1 ||xp 

-oo{ ||*|| 4 j 


r)P/< 


Jl dx 1 ... dx p 




We therefore conclude that 

R (e JS ,e) = p-(p-2)E^jXfj<p 

for all 6 € ®P, as required. 


Number of Mozart's last symphony. The 
polynomial n 2 + n + 41 gives primes for \n\ < 40. 
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First published in issue 36,1973 

O ne shuffles a pack of cards by dividing 
it into two portions and merging them 
onto a flat surface by riffling with the 
hngertips. The problem that concerns us is: How 
many times do we have to do this before the pack 
is perfectly shujfled , and what exactly is the best 
method ofshuffling? A pack of cards is said to be 
perfectly shuffled if 

(i) either all possible decks are equally likely 
(this is what we require for patience), 

(ii) or all possible deals of the deck into hands 
are equally likely (this is what we require for 
Bridge). 

The objective of this article is to hnd an explicit 

solution of the problem in case (i) and give some 
guidelines in case (ii) which may enable some in- 
terested readers to solve the problem in that case 
also. We will start by proving some subsidiary re- 
sults, after noting that any sequence of shuffles of 
a pack of N cards can be regarded as a random 
permutation of the integers from 1 to N. 

Lemma 1 If D(i) for i- 1,..., k is the set of 

decks which can arise from shutlling a given 

pack once, and if there exists x t > 0 such that 

Z?=i x { = 1, then there exists a shuffling strategy 

under which P (D(i)) =x { . 

Proof: Dehne P ;L (a) = P(a,L | a), where a is a 
sequence of the form (L, R, R, L, R, L, ...) telling 
us whether each of the hrst j - 1 cards fell from the 
right or the left. Then the shutlling strategy is de- 


hned by the P ; I (a). Set S(a) be the subset of D(i) 
which corresponds to shuffles beginning with a 
(the reader should satisfy himself that the obvious 
correspondence between shuffles and decks is a 
bijection). Then we set 

PjAoc) = £i:D(i)65(a - L)Xi . 

T,i:D(i)eS(oc) * (i) (ii) * * * * * * * x i 

A simple inductive check shows that this is the re- 
quired strategy. □ 

Lemma 2 There is a bijection between decks 
obtained by shuffling a new pack of N cards m 
times or less, and sequences a { for i = 1 , ...,N 
satisfying 

(i) 1 < < N; 

(ii) a { ^ a ; - whenever i ^ j ; 

(iii) a { can be expressed as the union of p 
subsequences b k Lp all of which satisfy 
b k Li -b k Li=l = 1, wherep < 2 m . 

Proof: Label each card of the new deck with an 
integer from 1 to N, starting from the top and 
working down. Then any deck after m shuffles can 
be represented as a sequence satisfying (i) and (ii). 
Furthermore, the 2 m subsets of the deck, such that 
two elements of the same set were in the same 
portion of the pack after every cut (some sets of 
which may be empty), correspond to the subse- 
quences b Li , for a shuffle cannot change the order 
of such a subset. 

Conversely, suppose we are given the subsequence 
b Li . We may assume without loss of generality that 
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each sequence b\. is the maximal subsequence 
satisfying (iii) and containing b k Lp otherwise one 
would concatenate subsequences until this were 
true. 

Dehne a vector \(k) with [log 2 p] components, 
such that v z (A;) is the coehicient of 2 Z_1 in the bi- 
nary expansion of k - 1. Assume also without loss 
of generality that b\. is the least number not equal 
to one of U jti 1 b{. by ordering the subsequences 
in a canonical manner. 

Then corresponding to this sequence a { we dehne 
a sequence [log 2 p] shuthes. The sequences b\. are 
all in the top half at the j’th cut if v N x _j{k) > 0 
and are all in the bottom half otherwise. A sim- 
ple inductive argument (left to the reader) shows 
that such a sequence of shuffles exists and is 
unique. Furthermore, since p is not greater than 
2 m , [log 2 p] it is not greater than m. □ 

We now proceed to dehne a sequence of optimal 
shuffling procedures P v P 2 > •••> P n suc h that, if 
{D n (/), i - 1, ..., k n } is the set of all decks which 
can be obtained by shuffling a new deck n times or 
less, then all the D n {i) are equally likely after the 
successive application of P ly ..., P n . We dehne P n 
inductively: let P v ..., P n _ 2 be perfect shuffles and 
let M n (j) be the number of elements of D n _ x (i) for 
i- 1,..., k n _ x , which can give rise to D n (i) after a 
single cut and shuffle. Let {D n (k C j ), i = 1,..., L n C } 
be the set of all decks obtainable by a sequence 
of n shuffles in which C was the last cut. The we 
dehne the probability that the cut in P n is C to be 

Ln,C 1 

y . 

U M„(J) 


By applying Lemma 1 we can construct a shuffling 
strategy such that 

P(cut = C, deck = D n (k ( J) = 1 ■ 

kn M n (k c ,j) 

Summing over all possible cuts we get P(deck = 
D n (k c ,j)) = 1 /k„. Thus we have dehned P n . The 
reader might care, as an instructive exercise, to 
work out P v the optimal procedure for the hrst 
shuffle. 

It is now easy to see that the smallest number of 
shuffles necessary to randomise a pack of N cards 
completely is [log 2 N]: we consider the deck in 
which the original order of the pack is reversed 
and apply Lemma 2 to see that D n (i ), case P x P 2 
... P n , will perfectly shuffle the pack. This is a nice 
result, and what one would expect from informa- 
tion theory. 

The corresponding result for case (ii) mentioned 
in the introductory paragraph would be that one 
requires [Tog 2 M] shuffles, where M is the number 
of players. Unfortunately the beauty of this result 
is spoiled by the fact that it is false - the true value 
is [log 2 /(M,N)], wher ef(M,N) is the least/such 
that for any M hands of N/M cards H) for j = 1, 
...,N/M and i = 1, ..., M, there exists a sequence 
a { of N integers as dehned in Lemma 2, with the 
number of subsequences b k less than or equal to 
f(M,N) and also 

N/M N -1 

U H) = u 

;'=i j=o 

Unfortunately it is not clear how f(M,N) can be 
evaluated; all that is clear is that M < f(M,N) < 
N. Perhaps one of our readers would care to earn 
himself a place in the hearts of Bridge players eve- 
rywhere by solving this problem? 








First published in issue 16,1953 

M ost mathematicians know the theory of 
the game of Nim, described in books 
on mathematical recreations. But few 
seems to be aware of Dr P. M. Grundy s remark- 
able generalisation, published in Eureka 2 in 1939. 
Consider a game T in which 2 players move alter- 
nately, and the last player wins (moving to a “ter- 
minal position”). Dehne inductively a function 
G(P) of the position P as follows: 

(a) if P is terminal, G(P) = 0; 

(b) if there are permitted moves from P to Q, 
from P to P, from P to S, and so on, then 
G(P) is the least non-negative integer differ- 
ent from all of G(Q), G(P), G(S) y ... 

It follows that if 0 < r < G(P) there is a move from 
P to some P with G(P) = r, but no move to any 
position U with G(U) = G(P). If positions P with 
G(P) = 0 are called “safe,” the winning strategy is 
to move always to a safe position: either this is 
terminal, and wins immediately, or the opponent 
moves to an unsafe position and the cycle repeats. 

Now imagine the players engaging in a “simulta- 
neous display” of k games I\, r 2 , ..., T k of this sort, 
the rule being that each player in turn makes a 
move in one and only one game, or if he cannot 
move in any game he loses. Let P v P 2 , ..., P k be 
the positions in the respective games T v r 2 , ..., T k . 
Then Grundy s Theorem states that 


For no player can gain any advantage by mov- 
ing so as to increase any G(P 5 ), as the opponent 
can restore the status quo. And if only decreases 
in G(P S ) are considered, the game is identical 
with Nim, thus proving assertion (i). Therefore 
G(P) = g if and only if the combined position 
(P, P') is safe, where G(P') = g. From that (ii) fol- 
lows fairly readily. 


It follows that we can analyse any such combined 
game completely, provided that we can hnd the 
G(P S ) for the component positions. Nim is an 
example; a heap H x of x counters constitutes a 
component position, since each player in turn 
alters one heap only, and G(H X ) = x. Many vari- 
ants of Nim are similarly analysed. Less trivial is 
Grundy s game, in which any one heap is divided 
into two unequal (non-empty) parts. Thus heaps 
of 1, 2, are terminal, with G = 0, a heap of 3 can 
only be divided into 2+1, which is terminal, so 
G(H 3 ) = 1. Generally G(H X ) in Grundys game is 
the least integer > 0 different from all nim-sums 
of G(H y ) and G(H x _ y ) for 0 < \x. The series goes 


x =0123456789 
G(H X ) = 0001021021 


generally, the G function of the 
combined position is the “nim-sum” of the 
separate G(P S ), i.e. obtained by writing the 
G(P S ) in the scale of 2 and adding columns 
mod 2. 


(i) this combined position is safe if and only if 
k heaps of G(PQ, G(P 2 ), ... y G(P k ) counters 
respectively form a safe combination in Nim, 


continuing with 0, 2,13, 2,1, 3, 2, 4, 3, 0, 4, 3, 0, 4, 
3, 0, 4, 1, 2, 3, 1, 2, 4, 1, 2, 4, 1, 2, ... This curious 
“somewhat periodic series” seems to be trying to 
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have period 3, but with jumps continually occur- 
ring. Richard Guy conhrmed this up to x = 300. 
He suggested that it might be played on a piano, 


taking 0 to be middle C, 1 = D, 2 = 
meaning then became eyident: 


E, etc. The inner 



Di vi ding helps in different parts, different parts. Not the easiest of the arts: No, no, no! (Just you try.) No, no, no! 


Guy also worked with rows R x of * counters, in 
which certain sets of consecutive counters could 
be extracted (thus possibly leaving two shorter 
rows, one each side of the extracted set). In his “*6” 


game, any one counter can be removed, except an 
Ri (= a single counter standing on its own). The 
G(R X ) series (x = 1, 2, ...) is a waltz. (Note that 
some notes span two bars.) 



If I’m al one, all on my own, there I must al ways stay. But if I touch a no ther such, 



I may be ta ken a way. And as a boon, this lit tle tune shows you the right move to play. 


But at this point the tune completely broke down. that. He said, “Yes, an error I made in the calcula- 

I asked Guy if he could think of any reason for tion.” After correction the waltz proceeds: 



This series still quite baffles me. The gener- term I cannot see. P’haps it just wanders a long aimlessly. 


This tries to be periodic with period 26, but jumps 
keep appearing. Many other such games give 
tuneful, somewhat periodic series, for no evident 
reason. Guy discovered two curious exceptions: 
his “*4”, remove 1 counter not at the end of a row, 
has exact period 34 for x > 54, and Kayles, remove 


1 or 2 adjacent counters, has exact period 12 for x > 
71. Thus these games have a complete analysis. But 
generally it might be helpful to bring in a profes- 
sional musician to study number theory. Perhaps 
a thorough study of the Riemann Hypothesis will 
uncover the Lost Chord. After all, why not? 


Kaprekar number, since 45 2 = 2025 and 20 + 25 = 45. 
Triangular, hexagonal and 16-gonal number. 
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The Logic of Logic 

Zoe Wyatt, Newnham 


I n the early 20 th century mathematicians em- 
barked on a quest to hnd a secure foundation 
for their subject, based on the use of axioms 
and rigorous logic. 

By itself however, a collection of axioms is not 
very useful, since it does not generate anything on 
its own. Only in conjunction with some logic, the 
rule of inference for example, do axioms lead to 
results. 

The rule of inference, or modus ponens , says: 

P^Q,P 

••Q 

In this and any mathematical expression, we use 
symbols to express ourselves. As with words in 
language and expression in conversation, we rely 
on ‘tools’ to convey the substance of our thoughts. 
Of course with words we can hnd inconsistencies: 

Ifit into my shirt. 

My shirtfits into my bag. 

Therefore Ifit into my bag. 

Though trivial, this shows that words have an un- 
derlying associated meaning. With this restriction 
in mind, we could fix the above by replacing ‘bag 
with ‘very large wardrobe’ 

Mathematics avoids such a restriction altogether; 
swapping P with Q in the modus ponens would 
still yield the same results. Obvious you might 
think, but philosophically this structural differ- 
ence is of great importance. 


Hilbert's Finitism 

Start with the statement: V x e Z, f(x) is true, 
where (f)(x) can be precisely one of ‘true’ or Talse’ 
If we negate this statement, would you imagine 
checking an infinite number of xs for falsity in 0 ? 
Or perhaps spot a suspicious looking x and prove 
him to be a counterexample? 

In the early 1920s, Hilbert was losing sleep on such 
matters. Or to be precise, he was concerned with 
making meaningful propositions and methods of 
reasoning which did not require the acceptance 
of infinite entities. This finitary viewpoint is par- 
ticularly important in the context of mathemati- 
cal operations, by only allowing arguments which 
can be translated into a finite set of propositions 
starting from a finite set of axioms. 

Of particular concern was the Quantifier Law of 
Excluded Middle (QLEM): 

Every x satisfies 0, or some x 
satisfies the negation of<p, 

where 0 is again a statement which is either true 
or false. 

Hilbert held a finitary view, meaning that if the 
domain being tested was infinite, the QLEM was 
not to be trusted. How could he know the value of 
9 (x) for any one of an infinite of x’s? More gener- 
ally, the finite belief prevented simultaneously al- 
lowing a property to be associated with inhnitely 
many objects. In our case, it means we cannot ap- 
ply an infinite conjunction to the integers: 

0(1) and 0(2) and 0(3) ... 
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If instead we are equipped with only hnite pro- 
cedures, then given a particular integer, we are 
able to prove the instance of (j)(n) for precisely that 
case. Hence negating this statement is not univer- 
sal. So not f(n) indicates that every instance of ‘n 
fails property </>’ is true, but it does not tell us that 
not every instance of f(n) is true. Put more simply, 
the following statements are not universally logi- 
cally equivalent: 

• Not every v satishes (f)(x); 

• Some x satishes not-0(x). 

Thus QLEM fails. More generally though, this 
shows that problems in our base assumptions 
need to be addressed to prevent ramihcations fur- 
ther on. 

Truth and its Limitations 

Also during the late 19th and 20th Centuries, 
many mathematicians began to question the lim- 
its on what kind of mathematical objects could 
be represented and manipulated. Aside from the 
popular Godels Theorem of Incompleteness, and 
Russells Paradox, a key if slightly less popular re- 
sult is Tarskis Undehnability Theorem. Published 
in 1936, this (very informally) states: 

Given ajormal arithmetic , a true arithmetical 

statement cannot be dejined in that arithmetic. 

To explain the original, technical form of this the- 
orem would take too long here [2], however we 
see from the above statement of the theorem that 
formal languages containing semantic terms like 
“true”, will always give a paradox when these terms 
are self-referenced. Tarski addressed this by mak- 
ing the distinction between semantically closed 
and semantically open languages. He dehned a 
semantically closed language to be one in which 
it is possible for a single sentence to predicate (de- 
termine) truth or falsehood in another sentence 
in the same language, or even of itself. Put simply, 
a semantically closed language can apply seman- 
tic properties to the terms that express semantic 
properties. 

This suggests that for a semantically open lan- 
guage to achieve consistency, we need to use a 
more powerful language, called a metalanguage, 
in order to be able to dehne a truth predicate. One 
of the most common uses of metalanguages is in 
computer science, such as the Backus-Naur Form 
developed in the 1960s, to describe the syntax of 
computer programming languages. 



So Have We Done Anything? 

Understanding what makes a statement true or 
false, and how our mathematics relates to our 
thoughts, has many times uncovered the limita- 
tions of underlying assumptions. Such question- 
ing often leads to fruitful ways of new thinking, a 
key example being the development of hyperbolic 
geometry in the 19th Century by the rejection of 
Euclids 2000 year old parallel line axiom. 

Similarly the work of Hilbert, Tarski and their 
contemporaries’ had large ramihcations in not 
only mathematics, but also in philosophy and se- 
mantics. Indeed the year after Hilbert published 
his foundation of classical mathematics, the phi- 
losopher Wittgenstein wrote extensively on the 
limitations of language, making the famous com- 
ment: 

The limits ofmy language , mean 
the limits ofmy world. 

If mathematics is our language, where do you 
think we are limited? 
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Discovery of the Higgs Boson 

4July2012 


CERN announced findings of a new par- 
ticle that was consistent with the pre- 
dicted Higgs boson. Evidence for this 
so-called'God Particle 7 has been sought 
after for many decades, and if this really 
was a Higgs, its existence would explain 
many mysteries of the universe, includ- 
ing how matter attains mass. 





21 / 12/2012 

According to widely ac- 
cepted arithmetic and astro- 
nomical theories, the world 
will... possibly ... probably 
not end. 





ABC Conjecture 

August2012 


Japanese mathematician Shinichi Mo- 
chizuki has published a proof online. It 
is still under verification. 

Let a, b, c be relatively prime integers with 
a + b = c. Then for any e>0, there is some 
C £ such that max(|a|, \b[ |c|) < C £ J] pjabc 
p 1+£ whenp is prime. 

Far reaching consequences include 
Roth'sTheorem, Fermat's LastTheorem 


JunkDNA 

September2012 

OK - not mathematics but a 
great discovery. While over 
98 % of the human genome 
had been previously thought 
to serve no purpose, and so 
called junk DNA, the ENCODE 
project released 30 papers 
disproving this to reveal that 
over 80% perform vital func- 
tions in the body. 



Type 1/49 in your calculator 
and you will be very surprised... 





















Hopes 
and Fears 

Paul Dirac 



First published in issue 32,1969 

A research worker who is actively follow- 
ing up some idea referring to the funda- 
mental problems of physics has, of course, 
great hopes that his idea will lead to an important 
discovery. But he also has great fears - fears that 
something will turn up that will knock his idea on 
the head and set him back to the starting point in 
his search for a direction of advance. Hopes are 
always accompanied by fears, and in scientihc 
research the fears are liable to become dominant. 

As a result of these emotions the research worker 
does not proceed with the detached and logical 
mind that one would expect from someone with 
scientihc training, but is subject to various re- 
straints and inhibitions which obstruct his path 
to success. He may delay taking some step liable 
to force a rapid show-down, and may prefer hrst 
to nibble at side-issues that provide a chance of 
achieving some minor successes and gaining a lit- 
tle strength before facing the crisis. 

For these reasons the innovator of a new idea is 
not always the best person to develop it. Some 
other person without the fears of the innovator 
can apply bolder methods and may make a more 
rapid advance. In the following there will be some 
examples that illustrate this situation. 

Anyone who has studies special relativity must 
have wondered why it was that Lorentz, after he 
had obtained correctly all the equations of the 
Lorentz transformation, did not then take the 
perfectly natural step of considering all frames 
of reference to be on the same footing and so ar- 
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Smallest number that can be written as the 
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riving at the relativity of space and time. History 
does not record just what it was that held Lorentz 
back, but it can only have been some kind of fear, 
perhaps a subconscious one. He did not dare to 
venture out into a domain of thought completely 
foreign to anything that anyone had ever imag- 
ined. He preferred to remain on the solid round 
of mathematical transformations, where his posi- 
tion was unassailable. It needed the boldness of a 
younger man such as Einstein to take the plunge 
into a new domain. 

The innovator of our present quantum mechanics 
was Heisenberg. At a time when atomic physicists 
were Aoundering about with the orbits of Bohr- 
Sommerfeld theory and feeling the need for a 
drastic alteration of basic principles, Heisenberg 
has the brilliant idea of constructing a new theory 
entirely in terms of observable quantities, quan- 
tities connected with observations on spectra. 
These are each connected with two atomic states, 
and the natural way of expressing them is in the 
form of matrices. Thus Heisenberg was led to con- 
sider matrices as dynamic variables. 

He had not proceeded far in developing this 
idea before he noticed that his dynamical vari- 
ables would not satisfy the commutative law of 
multiplication. This was most disturbing. It was 
inconceivable to a physicist in those days that 
dynamical variables could be any other than or- 
dinary algebraic quantities, and with the appear- 
ance of non-commutation Heisenberg had grave 
fears that his whole beautiful idea would have to 
be given up. 

When I read Heisenbergs hrst paper on the sub- 
ject, I had the advantage over him in not having 
his fears, as it was not my own idea that was at 
stake. I was therefore able to look at the question 
from a more detached point of view. 

I needed only a week or two to realize that the 
non-commutation which alarmed Heisenberg 
was really the dominating feature of the new 
theory. The idea of building up a theory entirely 
in terms of experimentally observed quantities, 
although a very pleasing philosophical doctrine, 
was of only secondary importance for the pur- 
pose of establishing a new dynamics. 

My early work on quantum mechanics was thus 
concentrated on the problem of bringing non- 
commutation into dynamical theory. It was not 
really very difkcult, because the previous atomic 
theory, the orbit theory of Bohr and Sommerfeld, 


was based on a form of dynamics, Hamiltons 
form, which turned out to be specially suitable for 
adapting to non-commutative algebra. 

Heisenberg continued to develop his theory, in 
collaboration with other people in Gottingen. I 
worked independently from them, apart from 
getting the initial idea from Heisenberg. We pub- 
lished papers at about the same time, setting the 
foundations for quantum mechanics. Our styles 
were different on account of the different points 
of view we held, mine being based on non-com- 
mutation and Heisenbergs on the use of matrices 
built up from observable quantities. 

Quantum mechanics was discovered quite inde- 
pendently by Schrodinger, working on entirely 
different lines. He had his own difhculties. He 
was thinking over the mathematical connection 
between waves and particles that had been dis- 
covered some time previously by de Broglie, and 
eventually found a way of generalizing it to apply 
to an electron moving in an electromagnetic held. 
He then had a very beautiful wave equation, con- 
forming to relativity. He proceeded to apply it to 
the hydrogen atom and his worst fears were real- 
ized. The results did not agree with observation. 

We know now that the discrepancy was due to 
the spin of the electron, which was unknown to 
Schrodinger at the time, although the experimen- 
talists had begun to suspect it. It was a most de- 
pressing situation for Schrodinger, and led him to 
abandon the work for some months, and eventu- 
ally to publish it only in the non-relativistic ap- 
proximation, in which the discrepancy does not 
show up. The relativistic equation was later re- 
discovered by Klein and Gordon, who were not 
afraid to publish an equation in disagreement 
with observation, while Schrodinger was. So the 
equation now bears their name. It has some value 
in describing spinless mesons. 

Schrodingers quantum mechanics was soon 
found to be equivalent to that originated by 
Heisenberg, in spite of their hrst seeming so dif- 
ferent. The basic equations of the new mechanics 
were securely established, and it became neces- 
sary to hnd a physical interpretation for them. 
With non-commutative algebra it could not be as 
direct as in the classical theory. The general physi- 
cal interpretation was found to be only a statistical 
one. One could calculate probabilities, but could 
not usually predict an event with certainty. 


One of few to be both a pentagonal 
number and a centred pentagonal number. 
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A difficulty now appeared in connection with the 
relativistic equation of Klein and Gordon. The 
theory sometimes gave negative probabilities. It 
was a satisfactory theory only when it was used 
non-relativistically. I puzzled over this for some 
time and eventually thought of a new wave equa- 
tion which avoided the negative probabilities. I 
found that it also gave automatically the spin of 
the electron, a most gratifying result. I proceeded 
to apply the new equation to the hydrogen atom, 
taking into account the relativistic corrections 
only to the hrst order of accuracy to simplify the 
calculations, and found agreement with observa- 
tion. 


The natural thing to do at this stage would have 
been to continue to higher orders of accuracy, but 
I did not do so. I was scared that they might not 
agree with observation. I hastily wrote up a paper 
with merely the hrst order of accuracy and pub- 
lished that. In doing so I felt I was consolidating a 
limited success, and even if the higher orders did 
go wrong there would still be something to stand 
on. It was left to Darwin, who did not share my 
fears to carry out the calculation to all orders of 
accuracy and see that the results were alright. 


In my hrst paper on the subject (Proc. Roy. Soc A 
117, page 610) there occurs the equation 

/w e \ 2 ( e \ 2 

F=(- + -A 0 ) + { P+ -a) 


2 2 

+ m c . 


The relativist, if he sees this equation nowadays, 
will say at once: there is a mistake here. The plus 
signs before the second and third terms on the 
right should be minuss. He will wonder how such 
a conspicuous mistake could have remained un- 
detected in the proof-reading. He will wonder still 
more when he sees the same mistake perpetuated 
in later equations. 

The explanation is that there is really no mistake 
and things were published as the author intended. 
The plus signs were the expression of a fear. At 
that time relativity was still unfamiliar and peo- 
ple had continually to cling to the symmetry of 
space and time so as not to let it out of their heads. 
The symmetry becomes perfect only if one uses a 
time variable which is v-l times the usual time 
and makes a corresponding change in all 4-vec- 


tors. With this notation there are no mistakes in 
the paper. This notation was frequently used in 
those days, and it was not considered necessary 
to explain it every time it was used, because the 
context made it clear. The arrival of the new wave 
equation rather forced one to give it up, as it then 
became too clumsy. 

The new wave equation led to a difficulty in that 
it allowed states of negative energy for the elec- 
tron. Negative energies are never observed, but 
they could not be ignored in the theory. I thought 
of a way of coping with them, namely, to assume 
that in the physical world all or nearly all of the 
negative-energy states are occupied, so that ordi- 
nary positive-energy electrons cannot jump into 
them. An unoccupied negative-energy state is a 
hole which appears as a particle with a positive 
energy and a positive charge. 

Right from the beginning I had the feeling that 
there would be symmetry between the holes and 
the electrons. This feeling was strengthened by the 
knowledge that in the chemical theory of the va- 
lency of atoms, there is a considerable amount of 
symmetry between an electron lying outside the 
closed shells and a hole in a closed shell. I did not 
want the symmetry. At that time it was believed 
that all positive charges were in protons, and 
the proton was much heavier than the electron. 
So I struggled with the hope that in some way 
the Coulomb interaction between the electrons 
would lead to a dissymmetry between the holes 
and the electrons, and was afraid that if this hope 
should fail the whole idea would have to be aban- 
doned. It was left to others, in particular Weyl and 
Oppenheimer, to make the bold assertion that 
mathematical symmetry demanded that the holes 
should have the same mass as the electrons. 

With these developments the theory of single par- 
ticles was put into order. There remained prob- 
lems concerned with interaction. If one sets up 
precise relativistic equations one hnds that the 
interaction is so violent that the equations do not 
have any solutions. The difficulties are still not 
satisfactorily resolved and point to the need for 
some further drastic change in the foundations of 
atomic theory. 
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Number of paying cards in one deck 
and number of white keys on a piano. 


Smallest prime number that does not 
divide the order of any sporadic group. 
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Quantum Gravity 

Stephen Hawking, DAMTP Cambridge 


First published in issue 32,1969 

T he interactions that one observes in the 
physical universe are normally divided 
into four categories according to their bo- 
tanical characteristics. In order of strength they 
are, the strong nuclear forces, electromagnetism, 
the weak nuclear forces and, the weakest by far, 
gravity. The strong and weak forces act only over 
distances of the order of 10“ 13 cm or less and so 
they were not discovered until this Century when 
people started to probe the structure of the nu- 
cleus. On the other hand electromagnetism and 
gravity are long range forces and can be readily 
observed. They can be formulated as classical, i.e. 
non quantum, theories. Gravity was hrst with the 
Newtonian theory followed by Maxwells equa- 
tions for electromagnetism in the 19th Century. 
However the two theories turned to be incom- 
patible because Newtonian gravity was invariant 
under the Galilean group of transformations of 
inertial frames whereas Maxwell equations were 
invariant under the Lorentz group. The famous 
experiment of Michelson and Morley, which 
failed to detect any motion of the Earth through 
the luminiferous aether that would have been re- 
quired to maintain Galilean invariance, showed 
that physics was indeed invariant under the Lor- 
entz group, at least, locally. It was therefore neces- 
sary to formulate a theory of gravity which had 
such an invariance. This was achieved by Einstein 
in 1915 with the General Theory of Relativity. 

General Relativity has been very successful both 
in terms of accurate verification in the solar sys- 
tem and in predicting new phenomena such as 


black holes and the microwave background radia- 
tion. However, like classical electrodynamics, it 
has predicted its own downfall. The trouble arises 
because gravity is always attractive and because it 
is universal i.e. it affects everything including light. 
One can therefore have a situation in which there 
is such a concentration of matter or energy in a 
certain region of space-time that the gravitational 
field is so strong that light cannot escape but is 
dragged back. According to relativity, nothing can 
travel faster than light, so if light is dragged back, 
all the matter must be confined to a region which 
is steadily shrinking with time. After a finite time 
a singularity of infinite density will occur. 

General Relativity predicts that there should be a 
singularity in the past about 10,000 million years 
ago. This is taken to be the “Big-Bang”, the begin- 
ning of the expansion of the Universe. The theory 
also predicts singularities in the gravitational col- 
lapse of stars and galactic nuclei to form black 
holes. At a singularity General Relativity would 
lose its predictive power: there are no equations to 
govern what goes into or comes out of a singular- 
ity. However when a theory predicts that a physi- 
cal quantity should become infinite, it is generally 
an indication that the theory has broken down 
and has ceased to provide an accurate description 
of nature. A similar problem arose at the begin- 
ning of the Century with the model of the atom as 
a number of negatively charged electrons orbiting 
around a positively charged nucleus. According 
to classical electrodynamics, the electrons would 
emit electromagnetic radiation and would lose 
energy and spiral into the nucleus, producing a 
collapse of the atom. The difficulty was overcome 


Sum of the first 10 integers and lOth Fibonacci number. 
Largest number that is both triangular and Fibonacci. 
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by treating the electromagnetic held and the mo- 
tion of the electron quantum mechanically. One 
might therefore hope that quantisation of the 
gravitational held would resolve the problem of 
gravitational collapse. Such a quantisation seems 
necessary anyway for consistency because all oth- 
er physical helds appear to be quantised. 

So far we have had only partial success in this 
endeavour but there are some interesting results. 
One of these concerns black holes. According to 
the Classical Theory the singularity that is pre- 
dicted in the gravitational collapse will occur in 
a region of space-time, called a black hole, from 
which no light or anything else can escape to 
the outside world. The boundary of a black hole 
is called the event horizon and acts as a sort of 
one way membrane, letting things fall into the 
black hole but preventing anything from escap- 
ing. However, when quantum mechanics is taken 
into the account, it turns out that radiation can 
“tunnel” through the event horizon and escape 
to inhnity at a steady rate. The emitted radia- 
tion has a thermal spectrum with a temperature 
inversely proportional to the mass of the black 
hole. As the black hole emits radiation, it will loss 
mass. This will make it get hotter and emit more 
rapidly. Eventually it seems likely that the black 
hole will disappear completely in a tremendous 
hnal explosion. However the time scale for this to 
happen is much longer than the present age of the 
Universe, at least for black holes of stellar mass, 
though there might also be a population of much 
smaller primordial black holes which might have 
been formed by the collapse of irregularities in 
the early Universe. 

One might expect that vacuum Auctuations of 
the gravitational held would cause “virtual” black 
holes to appear and disappear. Particles, such as 
baryons, might fall into these holes and be radi- 


ated as other species of particles. This would give 
the proton a hnite lifetime. However it is dithcult 
to discuss such processes because the standard 
perturbation techniques, which have been suc- 
cessful in quantum electrodynamics and Yang- 
Mills theory do not work for gravity. In the former 
theories one expands the amplitudes in a power 
series in the coupling constant. The terms in the 
power series are represented by Feynmann dia- 
grams. In general these diverge but in these theo- 
ries all the inhnities can be absorbed in a redeh- 
nition or “renormalisation” of a hnite number of 
parameters such as coupling constants as masses. 
However in the case of gravity, the inhnities of dif- 
ferent diagrams are different and so they would 
require an inhnite number of renormalisation 
parameters whose values could not be predicted 
by the theory. In fact the situation is not really 
that much worse than with the so-called renor- 
malisable theories since even with them the per- 
turbation series is only asymptotic and does not 
converge, leaving the possibility of adding an ar- 
bitrary number of exponentially vanishing terms 
with undetermined coethcients. 

The problem seems to arise from an uncritical 
application of perturbation theory. In classical 
general relativity it has been found that pertur- 
bation expansions around solutions of the held 
equations have only a very limited range of valid- 
ity. One cannot represent a black hole as a per- 
turbation of flat space-time yet this is what sum- 
ming Feynmann diagrams attempts to do. What 
one needs is some approximation technique that 
will take into account the fact that the gravita- 
tional field and the space-time manifold can have 
many diAerent structures and topologies. Such a 
technique has not yet been developed but we, at 
Cambridge, have been approaching the problem 
by studying the path integral approach formula- 
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tion of quantum gravity. In this the amplitudes 
are represented by an integral over all metrics 

j D[g\ exp(-%]) 

where D[g] is some measure on the space of all 
metrics g and I [g] is the action of the metric g. 

If the integral is taken over real physical metrics 
(that is, metrics of Lorentzian signature - + + +), 
the action I is real so the integral oscillates and 
does not converge. To improve the eigenvalues 
one does a rotation of 90° in the complex t-plane. 
This makes the metric positive dehnite (signature 
+ + + + and the action I pure imaginary so that the 
integral is of the form 

/ D[g\ exp(-I[g]) 

where I = -il. The Euclidean action I has certain 
positive dehnite properties. 

One is thus led to the study of positive dehnite 
metrics (particularly solutions of the Einstein 
equations) on four-dimensional manifolds. If the 
manifolds are simply connected, their topology 
can be classihed (at least up to homotopy) by two 
invariants, the Euler number as measuring the 
number of holes or gravitational instantons and 
the signature measures the difference between 
right-handed instantons and left-handed ones. 
It seems that the dominant contribution to the 
path integral comes from metrics with about one 
instanton per Planck volume 10“ 142 cm 3 s. Thus 
space-time seems to be very highly curved and 
complicated on the scale of the Planck length 
10- 33 cm, even though it seems nearly flat on larg- 
er scales. 

However we still do not have a proper scheme for 
evaluating the path integral. The difhculty lies in 
dehning a measure D[g] on the space of all met- 


rics. In order to obtain a hnite answer it seems 
necessary to make inhnite subtractions and these 
leave hnite undetermined remainders. There is a 
possible way of overcoming this dihiculty which 
may come from an extension of General Relativ- 
ity called supergravity. In this the spin 2 graviton 
is related to a spin 3/2 held and possibly helds 
of lower spin by anticommuting “supersymme- 
try” transformations. In these theories there is an 
equal number of bosons (integer spin particles) 
and fermions (half integer spin particles). The 
inhnities that arise in the path integral from the 
integration over boson helds seem to cancel when 
the inhnities that arise from the integration over 
the fermion helds, raising the hope that one could 
provide a proper mathematical dehnition of the 
path integral, maybe some limiting process. 

Supergravity theories have another very desirable 
feature, they may unify gravity with the other in- 
teractions and particles in physics. In 1967 Salem 
and Weinberg proposed a unihed theory of the 
electromagnetic and weak interactions. This has 
had considerable success in predicting experi- 
mental results though the hnal conhrmation will 
have to wait for the next generation of particle ac- 
celerators. Nevertheless, it has given great stimu- 
lus to attempts to unify the strong, the weak and 
the electromagnetic interactions into a “Grand 
Unihed Theory”. A feature of such theories is that 
the complete unihcation is seen only at the very 
high energies of the order of 10 19 Gev, at which 
quantum gravitational effects should become im- 
portant. It may well be therefore that one will be 
able to achieve the unihcation only by incorporat- 
ing gravity as well in a completely unihed theory 
which would describe all of physics. This was 
the goal to which Einstein devoted the last thirty 
years of his life, without much success. The pros- 
pects look brighter now though it is still probably 
quite a long way off. 


The 57-cell is a self-dual abstract regular four- 
dimensional polytope. It has symmetry group Z_ 2 (19). 
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T here has been a recent flurry of articles and 
books proposing that we live in a multiverse 
rather than a universe: there is not one uni- 
verse but many (Rees 1999, Rees 2000, Carr 2008, 
Greene 2011). This raises key issues about the 
validity and utility of mathematical models, and 
their relation to what exists. Mathematical mod- 
els must be coherent as models, based in what- 
ever the underlying physics is; but that is not by 
itself suthcient to make them physically relevant. 
If they are meaningful as mathematical models of 
some physical system, they must be applicable to 
that context; which means you need to be able to 
test them and see if they describe the system well. 
If there is no possible way to test them, you have 
a problem: it is unclear whether they are indeed 
reliable models of reality. And that is a major issue 
that arises as regards multiverse theories . 

A variety of kinds of multiverses have been en- 
visaged by many authors. In his recent book The 
Hidden Reality (Greene 2011), Brian Greene pro- 
poses nine different types of multiverse theories: 

1. Existence beyond the horizon: Invisible 
parts of our own universe. 

2. Chaotic inflation, leading to different ex- 
panding domains in separate places. 

3. Brane worlds of M-theory (Four-dimension- 
al space-times embedded in higher dimen- 
sional spacetimes). 

4. Cyclic universes, leading to different ex- 
panding domains at different times. 

5. The Landscape of string theory embedded in 
a chaotic cosmology. 


Number of stellations of an icosahedron. 
Number of orthorhombic space groups. 


59 




Proper Distance (Gly) 

▲ Figure 1 SpaceTime Diagram - Normal Distance and Time 


6. The Everett quantum multi-universe: other 
branches of the wave function. 

7. Holographic projections (currently a trendy 
proposal in cosmology). 

8. The universe is a computer simulation. 

9. All that can exist must exist—the “grandest 
of all multiverses”, the separate universes be- 
ing totally disjoint from each other. 

Now one thing is clear - they cant all be true, for 
they contlict with each other. There remains the 
hnal possibility: 

10. Maybe none of them is true - there is just 
one universe. 

I will concentrate on the most popular one: cha- 
otic inhation (2), usually coupled with the land- 
scape of string theory (5). I will show hrstly that 
there is no way to directly verify that this model 
is true, and secondly that it is not based in well 
understood and verified physics. Hence while it 
may possibly be true, it has not been proved so, 
and indeed that proof may well be impossible. 
The reason we cant prove a multiverse exists ob- 
servationally is due to the nature of its spacetime 
structure, which on a large scale is governed by 


Einsteins General Relativity Theory. When we 
model the large scale structure of the universe, 
our cosmological models are surprisingly simple: 
they assume a basic structure that is both spatial- 
ly homogeneous (all physical quantities are the 
same everywhere at the same cosmic time) and 
spatially isotropic (there are no preferred direc- 
tions in the sky when we average matter on large 
enough scales). This geometry is represented by 
the metric of the spacetime (see Appendix), which 
has a scale factor a(t) representing the change of 
distance between galaxies with time, whose time 
evolution is determined by the Einsteins gravita- 
tional field equations, depending on the matter 
and radiation content of the universe. The met- 
ric also determines the paths of photons through 
spacetime, and so in particular determines the 
size of the visual horizon as a function of cosmic 
time. 

To understand this properly one of course needs 
to contemplate the equations of the theory, given 
in the Appendix. However we can also under- 
stand its relevant properties straightforwardly 
from spacetime diagrams showing how causal 
relations work in these models. The way such 
diagrams work, and their relation to the under- 
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Comoving Distance (Gly) 


▲ Figure2 Space Time Diagram - Comoving Distance and Contormal Time 


lying spacetime metric, is discussed in detail in 
my book with Ruth Williams (Ellis and Williams 
1995). The relevant details are as follows. 

Figure 1 is a space time diagram representing spa- 
tial distances and cosmic time correctly. Time is 
plotted vertically, and distance horizontally. The 
start of the universe is at t- 0; galaxy world lines 
diverge from each other since then. Our galaxy 
world lie is at the centre (r = 0); the present is 
labelled “here and now”. Our past light cone is 
marked in red; this is the path through spacetime 
of photons that are reaching us now. Going back 
into the past, it reaches a maximum radius and 
then contracts back to the big bang singularity; 
this is basically because gravity bends light - one 
of Albert Einsteins major discoveries. 

Now the problem with that diagram is we cant 
see causal relations near the big bang very well. 
We can correct that by changing to stretched dis- 
tance and time coordinates, that transform the 
past light cones to lines at ± 45° and matter world 
lines to vertical lines (Figure 2). This is allowed 
because Einsteins theory allows the use of any co- 
ordinates whatever to represent a given spacetime 
(this is the principle of general covariance). The 


initial singularity - a point in Figure 1 - is then 
represented by the horizontal line at the bottom. 
One should note that this singularity is not part of 
spacetime - it is the boundary of spacetime. This 
diagram also shows something not represented 
in Figure 1: the dotted horizontal line just above 
the singularity. This represents the surface of last 
scattering (“LSS”), where matter and radiation 
decoupled from each other in the early universe. 
This is the furthest back that we can see, because 
the universe was opaque to radiation at earlier 
times. Hence any earlier physics - the way the 
universe was created, the subsequent inhationary 
era - is not visible to us (this is similar to the way 
the surface of the Sun hides its interior from us). 
Most importantly, whatever that earlier physics 
was does not affect light propagation since decou- 
pling of matter and radiation at the LSS: hence the 
causal limits on what we can see are unaffected by 
any such earlier physics. 

Now the key point is that there is a furthest set of 
matter we can see by any electromagnetic radia- 
tion whatever; its world lines are marked as the 
Visual Horizon on the right. It is the world lines 
of matter that pass through the intersection of our 
past light cone with the LSS. This matter emit- 


Three times Fortunate number. Maybe the largest prime 
dividing the product of the next two primes plus 1. 
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ted the cosmic blackbody background radiation 
detected by the WMAP satellite, so the famous 
microwave background anisotropy map (Figure 
3) is just the image we get of this matter at the LSS. 
Any further out matter cannot be seen or detected 
by us by means of any radiation whatever (assum- 
ing no radiation moves faster than light - a key 
feature of relativity theory). The causal horizon 
(marked here as “present day horizon”) lies fur- 
ther out, and depends on early physics. 

In order to get a better picture of our observa- 
tional limits, we need to step back a bit and see 
the bigger context for Figure 2, depicted in Figure 
4: the whole of Figure 2 is the left hand triangle 
there. We cannot detect matter outside there by 
any means whatever. Hence we have no means 
of telling what conditions are like inside the pre- 
sumed universe domain on the right (the same 
size as our entire visible universe domain on the 
left). Physics there might be the same as here, or it 
might be totally different. There is simply no way 
we can ever hnd out. 

So here is the basic problem for multiverse propo- 
nents: no observational data whatever are avail- 
able to verify their claims of distant universe do- 
mains out there with different physics than in our 
domain. If the basis of science is verifying theo- 
ries by observation, then multiverse theories are 
not science. The assumption made in those theo- 
ries is that we that can extrapolate to 100 Hubble 
radii, 101 000 Hubble radii, or much much more 
(the word Tnhnity’ is casually bandied about) to 
determine in broad terms what conditions are 
like there. That s not testable science. 

But there is another line of argument. Maybe one 
can justify the multiverse assumption if is a neces- 
sary outcome of known and tested physics, even if 
one cannot directly verify its existence. This is in- 
deed a sound line of reasoning. The problem with 
it is that several aspects of the physics supposed 
to lie the multiverse are hypothetical rather than 
well established: they are major extrapolations of 
known physics into the unknown, and those ex- 
trapolations may or may not be true. This issue is 
discussed in depth by Banks (2012), who shows 
quite clearly that none of the supposed physics 
(Coleman-de Luccia tunneling, the landscape of 
string theory, the supposed connection between 
chaotic inhation and string theory vacua) is well 
established physics. 

A third line of argument is that existence of a 
multiverse explains anthropic coincidences in 

4 Figure3 The Cosmic Background Radiation 
sky - our image of the Last Scattering Surtace. 






▲ Figure 4 The entire visible universe is a tiny traction of the claimed multiverse. 
Most of its regions (if they exist) are not observationally accessibleto us by any means. 


cosmology: why the universe is a suitable place 
for life to exist, in particular explaining the value 
of the cosmological constant (the “dark energy” 
currently causing the universe to accelerate). This 
case is made for example by Martin Rees (1999, 
2001), see also Carr (2009). Now it does indeed 
provide such an explanation. Does this therefore 
justify belief in a multiverse? Yes if you think 
theory trumps observational testing in a scientihc 
theory; no otherwise. Key philosophical issues 
about the nature of scientihc theories underlie 
this choice; a discussion is given in Ellis (2006). 

There are however two exceptions to this gloomy 
picture re testability of the multiverse idea. The 
hrst is the possible existence of “small universes”: 
universes where the spatial sections are spatially 
closed on a scale smaller than that of the visual 
horizon (Lachieze-Ray and Luminet 1995). In 
that case the horizontal axis of Ligure 2 would 
close on itself on a scale smaller than that of the 
visual horizon, and we would already have seen all 
the matter there is in the universe, thus disproving 
the multiverse hypothesis. This intriguing possi- 
bility can be tested in various ways, in particular 
by searching for identical circles of temperature 
fluctuations in the CMB sky. This search has so 
far proved unsuccessful: this remains a possibility, 
but is perhaps unlikely. 

The second exception would be if there were colli- 
sions between different bubbles in the multiverse, 
resulting in detectable disk-like patterns in the CB 
sky. If such bubble collisions were detected and 
additionally could be associated with a variation 
of physics in the different bubbles, for example 
different values of the fine structure constant, this 
could legitimately be taken as vindication of the 
physics supposed to underlie the multiverse pro- 
posal. This has so far not been observed. 

A hnal comment relates to the issue of inhnities. 
It is often said that inhnities of universes occur in 
the multiverse (see for example Vilenkin 2007). 
This is a very dubious claim. Lirstly, David Hilbert 
has stated “the inhnite is nowhere to be found in 


reality, no matter what experiences, observations, 
and knowledge are appealed to.” (Hilbert 1964). I 
strongly concur. Secondly, in any case such claims 
are not verifiable, for there is no possibility what- 
ever of verifying them (no matter how many en- 
tities you have counted, you have not proved an 
inhnity exists). If science is to do with testable 
claims, then any such claims are not science. 

Lor other motivations for the multiverse, argu- 
ments in its favor, and counterarguments, see my 
Scientihc American article (Ellis 2011), the book 
edited by Carr (2009), and Kragh (2012). 
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Primes and Particles 

Jack Williams, Clare 


T he long-studied and incredibly elusive Rie- 
mann zeta-function has bathed number 
theorists for centuries. Deeply connected to 
the prime numbers and their distribution among 
the integers, its importance in number theory is 
well known. The Riemann hypothesis is perhaps 
the most famous unsolved problem in mathemat- 
ics, not least because the Clay Mathematics Insti- 
tute has offered $1 million for a solution. 

However, interest in this mysterious function 
extends far beyond esoteric results in number 
theory. With implications for physics, probability 
and statistics, it is more than just a number theo- 
retic curiosity. The underlying distribution of the 
zeros along the critical line 91 (s) =2 penetrates 
many branches of mathematics and there has 
been growing interest in the obscure connections 
it reveals between these helds. 


When dehned as 


«*) = £ i 


for 91 (s) > 1, it seems unconnected to the primes. 
But Euler hrst noticed the factorisation 
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which holds by the fundamental theorem of arith- 
metic. Summing the geometric series yields 


C(i) 


n 

p prime 


1 


1 ~p- s 


This factorisation, which can be made rigorous, 
reveals a deep connection with the primes. 

As an illustration of the zeta-functions role in 
number theory, we give simple proof of the famil- 
iar Euclidean theorem. 


Theorem There are inhnitely many primes. 


Proof: Suppose not. Then taking s 
identity, 
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gives 
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p prime 
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which must be hnite because there are only hnite- 
ly many terms in the right hand side. But since the 
harmonic series diverges, this is a contradiction. 

□ 


Scattered recklessly throughout the integers, the 
primes have been studied for thousands of years. 
Gauss prime number theorem states that 


n{n) 


n 

ln (n) 


where n(n) is the number of primes less than or 
equal to n. This means that the probability that is 
prime is roughly for large n. Here n need not 
be very large for a good approximation to n(n). 
For n = 100, it estimates about 21 primes when the 
correct value is 25. 


Smallest number that becomes a square when its 
reverse is either added to or subtracted from it. 
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4 Probability density 
tunction for the normalised 
spacing between consecu- 
tive zeroes of the Riemann 
zeta-function 


In 1859, Riemann published another linkbetween 
the zeta-function and the primes, giving an exact 
formula for n(n) , which improved on the prime 
number theorem. Riemanns formula uses the 
location of zeroes in the analytic continuation 
of the Riemann zeta-function. Number theorists 
are interested in it because the distribution of ze- 
roes gives information about the distribution of 
primes. 

While prime numbers are sparser among the larg- 
er integers, the Riemann zeroes become denser 
further from the real line. It is possible to show 
that 

V 7 2 7T 

where n(T) = \{y e K.: ( Q + yi) = 0,0 < y < T}|. 
However, these zeroes are not arranged regularly 
along the critical line and the hner detail has been 
studied in more depth. 

One way to get information about this distribu- 
tion is to study a spacing distribution, the dis- 
tribution of the distances between consecutive 
zeroes. These distances can be normalised to ac- 
count for the global’ distribution given by n(T). 
The correct normalisation is yj - which 

results in the following distribution, estimated 
numerically using zeroes numbered 10 21 + 1 to 
10 21 + 10000. 

Surprisingly, this distribution arises naturally in 
other areas of mathematics and physics and seems 
to be fundamental to many seemingly unrelated 
problems. Experiments have been performed in 
which neutrons are scattered off a heavy nucleus. 
The resulting cross-section contains peaks, the 
scattering resonances, and troughs. If a neutrons 
energy is near to one of the peaks, it is repelled by 
the nucleus and if it is near a trough, it can pass 


through effortlessly. This strange behaviour leads 
to different scatterings. Although these problems 
are too dihicult to solve analytically or numeri- 
cally, empirical data can be obtained and a similar 
analysis to the separation of the zeroes can be per- 
formed on these peaks. Astonishingly, the proba- 
bility density matches that of the Riemann zeroes. 
In fact, the density does not depend greatly on the 
particular nucleus being used. This remarkable 
connection is still poorly understood. 

The mathematical structure which models the 
scattering resonances is also quite unexpected. In 
the 1950s, Eugene Wigner proposed a statistical 
model based on random matrices. Inspired by 
Heisenbergs formulation of quantum mechanics, 
in which properties of an atom or a nucleus can be 
described by a Hermitian matrix, he put forward 
the random-matrix conjecture. He suggested that 
these peaks follow the same spacing as eigenval- 
ues of Hermitian matrices whose elements are 
chosen from some probability distribution, usu- 
ally normal. The eigenvalues of the matrix are cor- 
respond to energy levels of the spectrum. Using 
such random matrices, one can obtain spacings 
that are statistically similar to both the neutron 
scattering data and the distribution of zeroes of 
the Riemann zeta-function. 

Although it is possible that the zeroes play some 
direct role in physical systems, the unexpected 
connection between the primes, matrices and 
neutron scattering may reflect a deeper result, 
uniting many seemingly disjoint areas of math- 
ematics. There has been progress in this direc- 
tion. Theorems have been proved showing that 
there is a single limiting distribution for the ei- 
genvalue spectrum arising from a large set of ran- 
dom matrices. Similar in spirit to the central limit 
theorem, these results mirror the type universality 
intrinsic to results which connect many areas of 
mathematics. 
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Largest number below 2000 not 
containing an 'e' (an eban number). 


Irregular and lucky and Pillai prime. Maximum 
number of letters in website domains. 
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M-Theory, Duality and Art 

Dr David Berman, Queen Mary University London 


T his will not be an article about mathematics 
or theoretical physics but more about why 
we carry out such activities and the com- 
mon ground mathematicians share with those 
in the arts. When we look at the long history of 
mathematics there is a deep aesthetic sensibility 
present in the writings of mathematicians. Com- 
mon references to beauty, symmetry emerge as 
almost a guiding tool. After all, mathematics is 
not just about what is true but what is interesting. 
What is interesting and beautiful seems somewhat 
subjective and yet when faced with the profound 
beauty of any number of mathematical theorems 
(the reader should think of her favourite theo- 
rem at this point) then it just seems so blindingly 
obvious that no explanation is ever needed. I am 
a theoretical physicist and I remember still the 
spine tingling excitement at hrst seeing Noether s 
theorem. This link between the symmetries of the 
laws of nature and conserved quantities is to me 


an example of the irresistible beauty that math- 
ematics can uncover in nature. 

A typical example of the aesthetic raptures of 
mathematicians is found in Poincares famous 
quote, “The scientist does not study nature be- 
cause it is useful. He studies it because he delights 
in it and delights in it because it is beautiful.. .1 do 
not talk here of the type of beauty that strikes the 
senses but a profound beauty that comes from the 
harmonious order of the parts.” 

Poincares “harmonious order of the parts” is 
somehow the mathematical beauty we know and 
love. His “beauty that strikes the senses” is I sup- 
pose a reference to more common ideas of beauty 
present in the visual arts. And yet are the two so 
necessarily different or can one rehect the other 
though perhaps with the inevitable loss of an im- 
perfect rehection. 


In a normal distribution, 68% of values are 
within one standard deviation of the mean. 




When we look at some of the goals and ideas of 
artists there is often a theme uncovering different 
“ways of seeing” and of challenging our everyday 
view of the world. (This is a quick and perhaps 
over easy generalisation but let me continue). In 
the 20th century, the artist followed many of the 
revolutions of 20th century physics with a remov- 
al of direct representation and a strong foray into 
abstraction. In Mondrians seminal essay, “Natu- 
ral reality and abstract reality”, the artist puts 
forward a manifesto justifying the abstraction of 
universals from nature, concentrating on essential 
building blocks and looking at structural connec- 
tions while ignoring the detail present in the in- 
dividual objects to bring out the common, shared 
structures in nature. His essay is comfortable 
reading for a contemporary theoretical physicist. 

And so in goals and in a very human belief in aes- 
thetics perhaps the differences are not so great af- 
ter all. The language of mathematics is where bar- 
rier to communication lies. Its enormous power 
prevents the concepts from being accessible or 
visceral. The challenge then is to capture some- 
thing of an idea or concept in physical work. 

Duality is a key idea in mathematics and is now 
at the centre of the frontier in theoretical phys- 
ics, string and M-theory. (One can argue whether 
duality really means the same thing in the context 
of mathematics and in theoretical physics but we 
will pass over this). 


4 "Generalised Geometry 1&2", Berman and Davey 
▲ "125 GeV" in wood, Berman and Davey 

In string theory, there is a duality symmetry 
known as T-duality. This is a fundamental ambi- 
guity in the description of the space time back- 
ground in which the string lives. If the space time 
has some specihc properties (technically, it should 
possess an isometry and be compact so that its 
hrst homotopy class is nontrivial) then there will 
be two backgrounds that will be related to each 
other that in ordinary differential geometry will 
be inequivalent and yet will be indistinguishable 
from the point of view of the string. These pairs 
are known as T-duals. This duality is stringy in 
nature and leads on to the idea of stringy geome- 
try that differs from our usual notion of geometry 
in that such ideas of T-duality get built in. 

The Turner prize winning artist Grenville Davey, 
has spent many years working on sculptures that 
should be seen in isolation but as objects that 
bring out relationships and symmetries. They are 
collections of objects which when brought togeth- 
er show a relation. 

Together Grenville and I have been exploring 
ways to have a sculptural manifestation of some of 
the ideas in theoretical physics such as T-duality 
and spontaneous symmetry breaking. The goal is 
not to explain or exemplify but simply to inspire. 


Its square and its cube together contain every digit 
exactly once. Largest factorial on many calculators. 
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Computer representations of T-dual manifolds 
though perhaps accurate are not what we were af- 
ter. Instead the works were to be inhuenced by the 
mathematics rather than represent the mathemat- 
ics in some faithful way. This allowed the process 
to be free and in the end driven by the detailed 
aesthetics of the pieces themselves than by faith- 
fulness to an idea. The result has been a series of 
sculptures that have been shown in various gal- 
lery spaces but also in the Isaac Newton Institute 
for mathematical sciences. 


▲ "125 Gev" Berman and Daveyy 
▼ "Generalised Geometry" Berman and Davey 

Who knows whether there is anything of T-dual- 
ity or symmetry breaking in these works. What 
is interesting is the process by which the pieces 
came about and the fact that some very abstract 
mathematics has given rise to some pieces of 
sculpture and inhuenced the mind of a leading 
British sculptor. 
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One of only two to be both Pell number 
and a generalized heptagonal number. 


Algebraic degree of Conway's constant. Largest 
prime factor of the order of a sporadic simple group. 
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C limate change is now widely expected to 
cause signihcant changes to conditions on 
Earth in the next century, with our actions 
playing a key role in determining what happens 
next. One of the less well understood effects is sea 
level rise. This will likely be dominated by glacier 
retreat in Antarctica and Greenland. 

It is these glaciers in particular that we attempted 
to understand this summer. Our method was to 
use a laboratory model for ice, which captures a 
hitherto neglected but we believe critical aspect: 
changes in viscosity. With a high rate of shear 
(velocity gradient), there is increased melting be- 
tween adjacent crystals. This leads to a reduction 
in viscosity. 

Our laboratory model for ice was Xanthan, a 
shear-thinning biological polymer. We started 
by considering the ice shelf, believing (or hoping 
for) the sheet to have been solved for already, as 
a viscous gravity current. The situation without 


sidewalls was simple enough for us to solve it 
analytically without experiments to guide us. We 
found that the front goes as a power law in time 
(it accelerates). The other key result was that, for 
a fixed source thickness and fixed entry flux, the 
thickness at any location does not change (once 
the front reaches it). However, the source thick- 
ness remained a mystery. The width is still un- 
known (though it increases if the flux is higher). 

Happy with this very early (partial) breakthrough 
(the first real one in our careers), we then at- 
tempted to understand the effect of sidewalls. The 
motivation was ice shelves inside canyons, which 
is not at all unusual. Also common is slowly flow- 
ing pack ice completely filling a bay, leading to 
some friction at the edges. A typical experiment 
is shown in operating configuration, near the end 
of a run. 

Almost everything was made inside the workshop 
of the GK Batchelor Laboratory. We pioneered a 
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◄ Figure 1 The basic experimental 
setup. The flow is from left to right - 
under the sluice, over the weir, into 
the ocean. This makes it more uniform 
across the channel - essential if we 
want good sidewall contact. 


Bottom View 
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▲ Figure2 Shown here is the ratio of the frac- 
tional change in the front position to the fractional 
change in time, over a short period. Note that the 
tank is 15cm wide, so power-law behaviour starts at 
approximately three times this. 

sea level control system, to stop large rises in sea 
level as Xanthan enters the ocean. Note the mini- 
mal drop in Xanthan from the weir into the ocean 
- this is because the sea level is < 1 mm below the 
top of the weir. Without removing saltwater from 
the ocean, wed be forced to accept a 2 cm drop, 
potentially alfecting the shelf over a large region. 
Also, conditions would alter signihcantly during 
the experiment (as the 2 cm gap goes down to 0). 
Fortunately, our control system was able to hold 
the sea level constant to within 1 mm. 

Although there were other problems (like 3 specks 
of rust on the weir ruining 4 runs, until we real- 
ised); we mention the sea level in particular as 
we designed the system ourselves and because it 
wasnt controlled at all in a previous study, leading 
to it being fundamentally flawed. 

The key parameters (front position and peak 
thickness, near the source); are clearly visible in 
Figure 1. We took a photo like this every second. 
We speeded up the data analysis by using Matlab 
to trace the outlines (its 2012, this sort of thing is 
easy). What we were looking for were clues - in 
particular, if the front was going as a power law in 
time. But there was another major problem, and 
our apparatus wasn t to blame. 

The Xanthan was at a 1% concentration - this was 
too viscous. It didnt really have enough time to 
spread laterally, so the thickness at the walls was 
low, even in the best experiments. This meant that 
sidewall friction might not be dominating the sys- 
tem (compared to water pressure). 
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▲ Figure3 Front position plotted against time on 
logarithmic axes, with rescaling according to chang- 
es in flux and other parameters. The red band is our 
theory, allowing for errors. The blue and green data 
points are from experiments set up slightly differ- 
ently, while the black ones are considered unreliable. 

Although we got a tentative indication of power 
law-type behaviour, we werent happy and re- 
duced the concentration to 0.5% after a quick 
test. Then we had a real Eureka moment, though I 
didn t fully appreciate what it meant (unlike some 
of the people on the team). As can be seen below, 
the behaviour did indeed appear to converge to a 
power law. The uptick at the end is due to seawa- 
ter extraction sucking the shelf with it (a little). As 
the front slows down, the flat region on the graph 
below actually lasts for 100 seconds (in a 250s ex- 
periment). 

This was the dehning moment of our project. We 
rapidly found more experiments indicating simi- 
lar behaviour. Then, the quest was on to explain 
it - everyone was more conhdent than me that we 
would now be able to succeed for sure. The good 
thing was that the value of 0.6 above is only a bit 
different to what would have happened if xanthan 
was a Newtonian fluid (value = 0.67). However, I 
didnt share the enthusiasm that the shear-thin- 
ning nature of xanthan was having only a minor 
effect - maybe on the numbers, but I thought it 
made things fundamentally different. 

We thought about it very carefully and eventually 
realised that, amazingly, the behaviour is indeed 
similar to the Newtonian case (where the flow 
is essentially a Poiseuille flow, with gradients in 
thickness driving it). Once the shelf is very long so 
sidewall friction dominates, then for our case, we 
get what we termed a generalised Poiseuille flow. 
The velocity is still polynomial in lateral position, 
but (for a shear-thinning fluid) theres a sharper 
edge (i.e. the power is more than 2). 


The 21 st prime number. Its reverse, 37, is the 12th prime 
number. Also, 21 is 10101 in binary while 73 is 1001001. 
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▲ Figure4 Speed as a function of position across 
the channel. The theoretical curve is the smooth 
blue one. Raw data is in red, but errors are present 
so any value within the green curves is consistent 
with the data. Note that the tank is 15cm wide. 



▲ Figure5 Side view of our shelt.Theory predicts 
the apparent discrepancy in the middle, where the 
gradient exceeds that near the source (right) - this 
should cause a variation in velocity along the tank. 

▲ Figure6 Here we zoom into the region near 
the front (left) in Figure 5, which should behave 
as if there are no sidewalls. (Thus, the thickness 
should not vary.) A black horizontal line is drawn on. 
Although this is a small length of shelf, the discrep- 
ancy with the sloped red line is obvious. Note that 
there are seeds in the Xanthan (for PIV). 


We didn t see Figure 3 straight away, of course. It 
was quite something to see it slowly emerge - we 
didn t have the theory fully worked out until half 
the experiments were done. What was especially 
crucial was that we had the gradient measured 
before there was a theory, and the gradient told 
us how the frictional force scales with veloc- 
ity. Knowing there was viscous drag in a narrow 
boundary layer near the walls, this measurement 
meant we realised within days what was going on, 
allowing the theory to be developed. This would 
have taken much longer without the data. 

We also tracked particles that we put into the 
Xanthan. The results are shown below. The lateral 
velocity prohle agreed very well with our model. 
The (peak) velocity where the PIV (particle track- 
ing) was done is 0.246 ±0.010 cm/s. 

The speed of the front at the same time was 0.27 
± 0.01 cm/s. We expected the latter to exceed the 
former by 11%. The key thing is that such varia- 
tions lead to forces other than from sidewalls, but 
such forces are not important for long enough 
shelves (we hoped). Proving that such velocity 
variations are small meant such forces were small 
and so our model was fundamentally correct (i.e. 
it got the dominant force balance). We also com- 
pared the thickness prohle along the channel with 
our prediction (that its nearly a perfect triangle). 
This also showed very good agreement. 

We then turned our attention to better under- 
standing ice tongues (i.e. without sidewalls). The 
above velocity prohle would become uniform 
across the channel. 


We expected the grounding line to stabilise, be- 
cause like any floating object there should be 
some equilibrium. In our model, theres no ten- 
dency to thicken once some sort of equilibrium 
is attained. Based on the extended flatline at 1 on 
Figure 2 and our theory, we expected the front to 
go linearly with time for the whole experiment 
(no drag to slow it down)! The acceleration inher- 
ent in our theory (described previously) was pre- 
dicted to be tiny in shelves of this length. Using 
the idea of forces along the channel balancing at 
the grounding line and using our understanding 
of what the forces are in the shelf (plus the vis- 
cous gravity current theory to determine this in 
the sheet), we created a computer model to pre- 
dict the grounding line thickness. 

We had previously done experiments without 
sidewalls. These were the most accurate ones we 
did, because we intended a 9 cm wide shelf in a 
15 cm wide tank not to hit the sidewalls in a tank 
90 cm long. The tank was levelled to within 2 arc- 
minutes, so we succeeded. We quickly realised 
that the front speed was constant! But, how thick 
was the shelf? Of course, at constant Q and con- 
stant front speed (and constant width, see photo- 
graph) the thickness of the whole shelf had to be 
constant. Photographs (not shown) revealed no 
surprise. But, we hadnt designed the experiments 
to measure the grounding line thickness, which 
was quite low (1 cm at most). 

In the end, we knew Q was constant and so was the 
front speed, and the width and height appeared 
unchanging in time and space. This allowed us to 
use flux conservation plus front speed and width 
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There are 74 different non-Hamiltonian 
polyhedra with a minimum number of vertices. 
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4 Figure7 Thesituation 
when there is a grounding 
line. Note that the tapering 
of the shelf occurs (in a Im 
tank) only if there is contact 
with the sidewalls. 


measurements to infer the thickness of the shelf 
(we always knew Q extremely precisely). To check, 
we counted pixels in the side view, but the very 
low thickness and shadows etc. meant this was 
inaccurate. However, it revealed consistency with 
the indirect measurements outlined above. These 
agreed very closely with our computer model! 

Next, we looked at an interesting effect (see 
comparison of real and laboratory models of ice 
tongues). This is due to a variable width at the 
grounding line, likely caused by a variable entry 
flux. After the grounding line, the shelf moves as 
a solid body. For a real glacier, Q oscillates annu- 
ally but for us it oscillated due to the action of our 
pump (which is peristaltic). We still dont know 
precisely how Q affects the width, but the tapering 
at the front (when Q was rising from 0 to its hnal 
value, as the experiment had only just started); 
indicates that greater fluxes lead to a wider shelf. 

The effort to understand more is still underway, 
but our involvement in it is likely over with the 


completion of this project. The end result is (this 
time) fewer questions than we started with, be- 
cause some have dehnitely been answered, in- 
cluding the most crucial one - whats the domi- 
nant force balance (mathematically)? 

If anyone wants to know what it took to get this 
far, basically its determination and hard work. If 
we could see how to do something but it would 
be very hard, then we would always remind our- 
selves that were lucky - sometimes, you can only 
wish you know what to do. Also crucial to our 
success was not feeling tied to anything that any- 
body (however experienced) predicted about the 
situation (i.e. believing it must be that way before 
the data came in). Instead of blindly trusting any- 
body s ideas, we believed in going with the flow’, 
being guided by data and intuition and above all 
else having an unquenchable conhdence in our 
ability to make at least partial progress on the road 
to understanding glaciers, even if nothing made 
sense and experiments didnt work (because who 
really knows what tomorrow will bring)? 


Number of unitorm polyhedra (excluding the 
intinite set). Pentagonal pyramidal number. 


















Finding Order 
in Randomness 

Maithra Raghu,Trinity 


G iven a complete graph on six yertices, 
denoted K 6 (a graph where every vertex 
is connected to every other vertex); we 
colour each edge of the graph either red or blue. 
Can we find a complete graph on three vertices 
(aka a triangle) such that all its edges are the same 
colour? What about for a bi-coloured K 10 ; can we 
find a monochromatic fC 4 ? 

In both cases, it is indeed possible. These prob- 
lems are an example of the finite case of a theorem 
in Ramsey Theory. 

Ramsey Theory is named after the British math- 
ematician Frank Plumpton Ramsey (1903 - 
1930) whose paper, On a Problem ofFormal Logic 
(1928), proved what is now known as Ramseys 
Theorem. This was not the first theorem proved in 
the area of modern Ramsey Theory; Issai Schur 
proved in 1916 that there always exist monochro- 
matic x, y,z in a finite colouring of the naturals 
such that x + y = z and Van der Waerden his 
eponymous theorem in 1927. However, Ramseys 
work was imperative in ensuring that all these re- 
sults were viewed collectively under one area and 
encouraging further research in this field. 

We have mentioned some problems for colouring 
a finite set of points; but what kind of patterns 
emerge if we colour an inhnite set of points? 

WNFC (When Naturals 
are Finitely Coloured) 

We start simply, assuming as before, that we only 
have two colours, blue and red, at our disposal. 


We now consider colouring edges of the complete 
graph with N as its vertex set with these two col- 
ours. This is a daunting thing to imagine, so we 
shall introduce some (abuse of) notation to make 
things easier. 

Let N (2) denote distinct pairs of natural num- 
bers such that order does not matter i.e (a, b ) = 
(b, a). Then our colouring is simply the function 
c : N (2) {blue, red}. 



We can now picture our daunting infinite com- 
plete graph as below, a sequence with pairs of the 
sequence connected by either red or blue lines. So 
a monochromatic subgraph in this context is sim- 
ply a subset of N on which c is constant. 

We now claim that we can find a subset M of N, 
with 

1. Minfinite 

2. M (2) monochromatic 

To prove this, we pick any natural number a x . All 
the lines coming out of a x are either red or blue. 
So there is some infinite subset B x of N - {a x } 
such that all the lines from a x to B x are of the same 
colour (pigeonhole two and K 0 ). We then pick a 2 
in B x and pick infinite B 2 such that all lines from 
a 2 to B 2 are monochromatic. We keep repeating 
to get a sequence a x ,a 2 ,a 3i ... with each a { having 
either red or blue associated with it. As there are 
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Smallest integer without unique or interesting 
properties worth mentioning here... 







only two colours, some colour occurs intinitely 
often and the terms associated with this colour 
give us our required set. 

An interesting diversion 

We can now prove that any sequence in a totally 
ordered set has a monotone subsequence. Let 
C(a iy dj) denote the colour of the edge (a f , aj). 
Then 

1. C(a iy aj) = red if a { > ay 

2. C(a f , a ; ) = blue if a { < a ; . 

But now we know we have a monochromatic set, 
which corresponds to a monotone sequence. 

This gives another way to prove the Bolzano- 
Weierstrass theorem! We let our totally ordered 
set be the reals, and we have proven that every 
bounded sequence has a monotone subsequence. 
But by the Fundamental Theorem for reals, any 
monotone sequence converges, so we have our 
convergent subsequence. 

And backto where we left... 

Two natural extensions follow: 

1. When two-colouring N (r) for some hnite r, 
can we fmd a subset M as before? 

2. What happens if we use some hnite 
number k of colours instead? 

It is indeed possible to hnd a set M as before in 
both cases. 

For part 1, we proceed by induction. The case r = 1 
is an application of the Pigeonhole principle, and 
the case r = 2 is what we have just proven. So let us 
assume the result for r = k and consider r=k + 1. 
As before, we pick some a x in N. Now notice that 
this induces a two colouring on (N - {a^})^. 
The diagram shows the case for r = 3: 



And by our induction hypothesis, there is an in- 
hnite monochromatic set M for this colouring, 
say colour red. But as the colouring was induced 
on our (r - l)-tuples in M by removing, when we 
add {aj back; we recover our r-tuple colouring, 
which has the same colouring as the (r - l)-tuple. 



So M u {aj provides the set we were looking for. 

And now for part 2, increasing the number of col- 
ours we can create disorder with. The case for col- 
ouring N (2) with some hnite number k of colours 
turns out to be surprisingly easy, using the ideas 
of induction and “colour-blindness”. 

We induct on the number of colours k. We as- 
sume that we only have two colours, red and eve- 
rything-but-red. Then by our previous work, we 
know there is a monochromatic set M. If M is red, 
then we are done! If not, then we’ve just reduced 
to k - 1 colours, which is soluble by the induction 
hypothesis. 

Intinite to Finite 

Before plunging back into the inhnite, note that 
we can prove hnite Ramsey from what we know 
of inhnite Ramsey. We write [n\ = {1, 2, 3, ..., n}. 

Theorem Let m, r e N. Then there exists n 
such that whenever [n] r is two coloured, there 
is a monochromatic set A c [n\ of size m. 

The proof is left as an exercise. Try and construct 
a two colouring of N (r) without a monochromatic 
M, providing the contradiction. 

The canonical RamseyTheorem 

We have managed to avoid the most daunting 
question yet. What happens if we dare to colour 
N 2 with inhnitely many colours? The question 
seems inane; what kind of pattern could we hope 


It is possible for a sudoku puzzle to have up to 
77 givens (out of 81), yet lack a unigue solution. 
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▲ Possible Structures of M 


to find? A monochromatic set M is out of the 
question; we have the power to pathologically col- 
our every point in N 2 a different colour. Yet even 
in this melange of colourings there is some order. 

Theorem If we have an arbitrary colouring 
of N (2) , there exists an infinite M of one of the 
following forms, for arbitrary i <j <k<l: 

(i) M (2) is monochromatic; 

(ii) Each point in M (2) has 
a different colour; 

(iii) (/,;), (k, /) in M (2) have the 
same colour iff i = k; 

(iv) (/,;), (k, l) in M (2) have the 
same colour iff j = l; 

These four possibilities are shown in the diagrams 
above. 

Here we give a sketch proof of this theorem. In 
the proof, we are going to work with sets of the 
form A (4) and colour with the colours of ‘same 
and £ different’ (based on certain properties of our 
set), and then use our previous work to find sets 
monochromatic in ‘same’ or £ different’ to filter out 
our desired properties. 

We first deal with the monochromatic case. 

First we two-colour N 4 by giving (/,j, k, /) colour 

• ‘same if C(/,j) = C(k , /) in original colouring; 

• £ different’ otherwise. 

By our previous work, there exists an infinite 
monochromatic set A v If A x is the colour ‘same 
then it is monochromatic (exercise). So we as- 
sume Aj is ‘different’. 

Note that A x being £ different’ implies that C(/,j) 
never equals C(j,k) in A v else we obtain a contra- 
diction (exercise). 

We define £ left same’ to be if C(i,j) = C(i,k) and 
£ left different’ otherwise, and £ right same’ as C(j,k) 
= C(i, k) and £ right different’ analogously. Now we 
two colour A x twice, to obtain A 2 and A 3 as fol- 
lows. We first find an A 2 monochromatic in £ left 
same’ / £ left different’ and from that form a mono- 


chromatic A 3 in ‘right same’ / ‘right different’ We 
now have various different cases: 

• If A 3 is £ left different’ and ‘right same’ then it 
is case (iv) from above. 

• If A 4 is £ left same’ and ‘right different’ then it 
is case (iii) from above. 

• Note that A 3 cannot be right and left same, 
as it is a contradiction to A x being £ different’ 

• Finally, we have the case where A 3 is dif- 
ferent in both sides. Note that we can find 
a subset A 5 such that A 5 is monochromati- 
cally different both regarding C(i, l) - C(j, k) 
as ‘same’ and C(i, k) = C(j, l) as ‘same’ 

In each case, we can find a monochromatic £ dif- 
ferent’ set M, as M being ‘same’ would result in a 
contradiction with A x being £ different’ Indeed, if 
we found a ‘same’ M for C(i,l) = C(j,k), then pick- 
ing i<j<k<l<m<n, we have C(i,n) = C(j,k) 
andC(/, n) = C(l,m), so C(j,k) = C(l,,) in A 1? which 
is a contradiction. 

This A 2 satisfies case (ii) from above, and we are 
done! 

In Condusion 

In this article, we have focussed solely on Ramseys 
Theorem. But modern Ramsey Theory extends far 
beyond this. Van der Waerdens theorem looks at 
finding monochromatic arithmetic progressions. 
Though originally considered a cornerstone of 
Ramsey Theory, Richard Rados astounding ex- 
tension of Schurs Theorem, suitably called Rados 
Theorem, gives an immediate solution to Van der 
Waerden and all its extensions. Hindman’s Theo- 
rem (proved 1973) then extends Rados ideas of 
partition regularity to an inhnite setting. Despite 
these leaps in understanding, we still lack basic 
information in many areas of Ramsey Theory. In- 
deed, even now, it is not entirely clear how Rado 
managed to conceive his ground breaking theo- 
rem. This absence of full understanding, com- 
bined with the relative clarity of the problems 
ensures that Ramsey Theory will be a fruitful and 
fascinating area of research in the future. 


Smallest integer that can be written as the 
* ® sum of four distinct sguares in three ways. 


Atomic number of Gold. A fortunate, Gaussian, 
happy, Higgs, Kynea, lucky, Pillai and regular prime. 
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Mathematics 
in Wartime 

G H Hardy 


First published in issue 3,1940 

T he editor asked me at the beginning of term 
to write an article for Eureka, and I felt 
that I ought to accept the invitation; but all 
the subjects which he suggested seemed to me at 
the time quite impossible. “My views about the 
Tripos” - I have never really been much interested 
in the Tripos since I was an undergraduate, and I 
am less interested in it now than ever before. “My 
reminiscences of Cambridge” - surely I have not 
yet come to that. Or, as he put it, “something more 
topical, something about mathematics and the 
war” - and that seemed to me the most impossible 
subject of all. I seemed to have nothing at all to say 
about the functions of mathematics in war, except 
that they hlled me with intellectual contempt and 
moral disgust. 

I have changed my mind on second thoughts, and 
I select the subject which seemed to me originally 
the worst. Mathematics, even my sort of math- 
ematics, has its “uses” in war-time, and I suppose 
that I ought to have something to say about them; 
and if my opinions are incoherent or controver- 
sial, then perhaps so much the better, since other 
mathematicians may be led to reply. 

I had better say at once that by “mathematics” I 
mean real mathematics, the mathematics of Fer- 
mat and Euler and Gauss and Abel, and not the 
stuff which passes for mathematics in an engineer- 
ing laboratory. I am not thinking only of “pure” 
mathematics (though that is naturally my hrst 
concern); I count Maxwell and Einstein and Ed- 
dington and Dirac among “real” mathematicians. 


I am including the whole body of mathematical 
knowledge which has permanent aesthetic value, 
as for example, the best Greek mathematics has, 
the mathematics which is eternal because the 
best of it may, like the best literature, continue to 
cause intense emotional satisfaction to thousands 
of people after thousands of years. But I am not 
concerned with ballistics or aerodynamics, or any 
of the other mathematics which has been specially 
devised for war. That (whatever one may think of 
its purposes) is repulsively ugly and intolerably 
dull; even Littlewood could not make ballistics 
respectable, and if he could not, who can? 

Let us try then for a moment to dismiss these sin- 
ister by-products of mathematics and to fix our 
attention on the real thing. We have to consider 
whether real mathematics serves any purposes 
of importance in war, and whether any purposes 
which it serves are good or bad. Ought we to be 
glad or sorry, proud or ashamed, in war-time, that 
we are mathematicians? 

It is plain at any rate that the real mathematics 
(apart from the elements) has no direct utility in 
war. No one has yet found any war-like purpose 
to be served by the theory of numbers or relativity 
or quantum mechanics, and it seems very unlikely 
that anybody will do so for many years. And of 
that I am glad, but in saying so I may possibly en- 
courage a misconception. 

It is sometimes suggested that pure mathemati- 
cians glory in the “uselessness” of their subject, 
and make it a boast that it has no “practical” ap- 
plications. I have been accused of taking this view 
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Godfrey Hardy (1877 - 1947) is not only famous ► 
for his achievements in number theory and 
analysis, but for mentoring Srinivasa Ramanujan 
and for writing '4 Mathematician's Apology" 
on the aesthetics of mathematics. 


myself. I once stated in a lecture, which was after- 
wards printed, that “a science is said to be useful 
if its development tends to accentuate the existing 
inequalities in the distribution of wealth, or more 
directly promotes the destruction of human life”; 
and this sentence, written in 1915, was quoted in 
the Observer in 1939. It was, of course, a conscious 
rhetorical fluorish (though one perhaps excusable 
at the time when it was written). 

The imputation is usually based on an incautious 
saying attributed to Gauss, to the effect that, if 
mathematics is the queen of the sciences, then 
the theory of numbers is, because of its supreme 
“uselessness,” the queen of mathematics, which 
has always seemed to me to have been rather 
crudely misinterpreted. If the theory of numbers 
could be employed for any practical and honour- 
able purpose, if it could be turned directly to the 
furtherance of human happiness or the relief of 
human suffering (as for example physiology and 
even chemistry can), then surely neither Gauss 
nor any other mathematician would have been so 
foolish as to decry or regret such applications. But 
if on the other hand the applications of science 
have made, on the whole, at least as much for evil 
as for good - and this is a view which must al- 
ways be taken seriously, and most of all in time of 
war - then both Gauss and lesser mathematicians 
are justihed in rejoicing that there is one science 
at any rate whose very remoteness from ordinary 
human activities should keep it gentle and clean. 

It would be pleasant to think that this was the 
end of the matter, but we cannot get away from 
the mathematics of the workshops so easily. In- 



directly, we are responsible for its existence. The 
gunnery experts and aeroplane designers could 
not do their job without quite a lot of mathemati- 
cal training, and the best mathematical training 
is training in real mathematics. In this indirect 
way even the best mathematicians becomes im- 
portant in war-time, and mathematics are wanted 
for all sorts of purposes. Most of these purposes 
are ignoble and dreary - what could be more 
soul-destroying than the numerical solution of 
ditTerential equations? - but the men chosen for 
them must be mathematicians and not laboratory 
hacks, if only because they are better trained and 
have the better brains. So mathematics is going 
to be really important now, whether we like it or 
regret it; and it is not so obvious as it might seem 
at hrst even that we ought to regret it, since that 
depends upon our general view of the effect of sci- 
ence on war. 

There are two sharply contrasted views about 
modern “scientihc” war. The hrst and the most 
obvious is that the effect of science on war is 
merely to magnify its horror, both by increasing 
the sufferings of the minority who have to hght 
and by extending them to other classes. This is 
the orthodox view, and it is plain that, if this view 
is just, then the only possible defence lies in the 
necessity for retaliation. But there is a very dif- 
ferent view which is also quite tenable. It can be 
maintained that modern warfare is less horrible 
than the warfare of pre-scientihc times, so far at 
any rate as combatants are concerned; that bombs 
are probably more merciful than bayonets; that 
lachrymatory gas and mustard-gas are perhaps 
the most humane weapons yet devised by military 


Number of stable chemical elements. One of three 
numbers to be the sguare of the sum of its digits. 


81 









"The applications ofscience have made, on the whole, at least as 
much forevilas torgood. Mathematicians like Gauss are justihed 
in rejoicing that there is one science whose very remoteness from 
ordinary human activities should keep it gentle and clean." 

GHHardy 


science, and that the “orthodox” view rests solely 
on loose-thinking sentimentalism. This is the 
case presented with so much force by Haldane in 
Callinicus. It may also be urged that the equalisa- 
tion of risks which science was expected to bring 
would be in the long run salutary; that a civilians 
life is not worth more than a soldier s, or a wom- 
ans than a mans; that anything is better than the 
concentration of savagery on one particular class; 
and that, in short, the sooner war comes “all out” 
the better. And if this be the right view, then sci- 
entists in general and mathematicians in particu- 
lar may have a little less cause to be ashamed of 
their profession. 

It is very dihicult to strike a balance between 
these extreme opinions, and I will not try to do 
so. I will end by pulling to myself, as I think every 
mathematician ought to, what is perhaps an easier 
question. Are there any senses in which we can 
say, with any real conhdence, that mathematics 
“does good” in war? I think I can see two (though I 
cannot pretend that I extract a great deal of com- 
fort from them). 

In the hrst place it is very probable that mathe- 
matics will save the lives of a certain number of 
young mathematicians, since their technical skill 
will be applied to “useful” purposes and will keep 


them from the front. “Conservation of ability” 
is one of the ofhcial slogans; “ability” means, in 
practice, mathematical, physical, or chemical abil- 
ity; and if a few mathematicians are “conserved” 
then that is at any rate something gained. It may 
be a bit hard on the classics and historians and 
philosophers, whose chances of death are that lit- 
tle much increased; but nobody is going to worry 
about the “humanities” now. It is better that some 
should be saved, even if they are not necessarily 
the most worthy. 

Secondly, an older man may (if he not too old) 
hnd in mathematics an incomparable anodyne. 
For mathematics is, of all the arts and sciences, 
the most austere and the most remote, and a 
mathematician should be of all men the one who 
can most easily take refuge where, as Bertrand 
Russell says, “one at least of our nobler impulses 
can best escape from the dreary exile of the actual 
world.” But he must not be too old - it is a pity 
that it should be necessary to make this very seri- 
ous reservation. Mathematics is not a contempla- 
tive but a creative subject; no one can draw much 
consolation from it when he has lost the power or 
the desire to create; and that is apt to happen to a 
mathematician rather soon. It is a pity, but in that 
case he does not matter a great deal anyhow, and 
it would be silly to bother about him. 
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Consecutive Integers 

Paul Erdos 


First published in issue 38,1975/76 

S ome time ago, two old problems on consec- 
utive integers were settled. Catalan conjec- 
tured that 8 and 9 are the only consecutive 
powers. First of all observe that four consecutive 
integers cannot all be powers since one of them is 
congruent to 2 modulo 4. 

It is considerably more dithcult to show that three 
consecutive integers can not all be powers; this 
was accomplished about 20 years ago by Cassels 
and Makowski. Finally in 1974 using some deep 
results of Baker, Tijdeman proved that there is 
an n 0 , whose value can be given explicitly, such 
that for n > n Qi n and n + 1 are not both powers. 
This settles Catalans conjecture almost completely. 

It has been conjectured that if x x < x 2 < x 3 < ... is 
a sequence of consecutive powers, such as x x = 1, 
x 2 — 4, ... then x i+l - x { > i c for all i and some 
suitable constant c. At the moment this seems in- 
tractable. 

It was conjectured more than a century ago that 
the product of consecutive integers is never a 
power. Almost 40 years ago, Rigge and I proved 
that the product of consecutive integers is never a 
square, and recently Selfridge and I proved the 
general conjecture. In fact, our result is that for 
every k and l there exists a prime p>k such that if 

p akj i n(»+o 

i =1 

then 

a kl = 1 modp. 


We conjecture that in fact for all k > 2 there is a 
primep > k with a k j = 1, but this is also intractable 
at the moment. 

It often happens in number theory that every new 
result suggests many new questions - which is a 
good thing as it ensures that the supply of Math- 
ematics is inexhaustible! I would now turn to dis- 
cuss a few more problems and results on consecu- 
tive integers and in particular a simple conjecture 
of mine which is more than 25 years old. 

Put 

m = a k (m) b k (m ), 

a k (m) = Ylp ap > 

where the product extends over all the primes 
p>k and p a \ m. Further dehne 

f(n ; /c, /) = min {a k (n + i) : 1 < i < /}; 

F(k , /) = min{/(n; /c, /) : 1 < n < oo }. 

I conjectured that 


In other words, is it true that for every e there is 
a k £ such that for every k > k £ at least one of the 
integers afn + i) for i =1,...,/, is less than k £ . I am 
unable to prove this but will outline the proof of 

F(k , k) < (1 + e)k for k > k 0 (e). (2) 

To prove (2) consider 

A(n,k) = n{=i fl *( n + 0 (3) 
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where the Yl' in (3) indicates that for every p <kwe 
omit one of the integers n + i divisible by a maxi- 
malpower of p. Then the product 11 CL k (n + i) has 
at least k - n(k) factors and by a simple applica- 
tion of the Legendre formula for the factorisation 
of k\ we obtain 


Y['a k (n + i ) | k\ 


(4) 


If (2) did not hold, we have from (4) and Stirlings 
formula 

\k-n(k) 


((1 + e)k) K ~ n(K> < k k+l e~ k 
or k n(k)+1 > e k (l + e) k ~ n(k) . 

Now, by the prime number theorem, 
(l + c/10 )k 


n{k) < 

and so from (5), 


log k 


(5) 




2 k \ 
log k)’ 


which is false if k is large enough. This contradic- 
tion proves (2). 

Assume for the moment that (1) has been proved. 
Then one can immediately ask for the true order 
of magnitude of F(k , k). I expect that it is o(k £ ) 
for every e > 0. On the other hand, I can prove that 

The problem of estimating F(k , k) and the proof 
of (6) is connected with the following question 
on the sieve of Eratosthenes-Prim-Selberg : deter- 



mine or estimate the smallest integer A(k) so that 
one can hnd, for every p with A(k) <p<k, a resi- 
due u p such that for every integer t < k, t satishes 
one of the congruences to u p modulo p. Clearly 
F(k, k) ^ A(k). Using the method of Rankin-Chen 
and myself I proved 

/ c logWloglo^lo^ 

\ l<w) 

which implies 6. I do not give the proofs here. It 
would be interesting and useful to prove A(k) < k £ 
for every e > 0 and suthciently large k. 

Now, I shall say a few words about F(k , 1) for k + 1. 

It follows easily from the Chinese Remainder The- 
orem that for 1 < n(k) we have F(k , /) = oo, since 
for a suitable n , we can make n + i for 1 < i < n(k) 
divisible by an arbitrarily large power of p x . It is 
easy to see that this no longer holds for / = n(k) + 1 
and in fact it is not hard to prove that 

F(k y n(k) + 1) = 0 P ap , 

where p a p < n(k) < p a p +l . As / increases it gets 
much harder to even estimate F(k> /). 

Many more problems can be formulated which 
I leave to the reader and only state one which is 
quite fundamental: Determine or estimate the 
least / = 4 so that F(k , l k ) = 1. 

In other words, the least l k so that among l k con- 
secutive integers there is always one relatively 
prime to the primes less than k. This question is 
of course connected with the problem of estimat- 



Centred triangular, square, 14-gonal and 
8-gonal number. Number of ways to tie a tie. 
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ing the difference of consecutive primes and also 
with the following problem of Jacobsthal: Denote 
by g(m) the least integer so that any set of g(m) 
consecutive integers contains one which is rela- 
tively prime to m. At a Number Theory meeting 
in Oberwolfach (November ‘75), Kanold gave an 
interesting talk on g(m). Vaughan observed that 
the sieve of Rosser gives g(m) < (log m) 2+£ for all 
£ > 0 if m is sufficiently large. The true order of 
magnitude is not known. 

It seems to me that interesting and dhhcult prob- 
lems remain for 1 < n(k) too. Here we have to con- 
sider the dependence on n too. It is not hard to 
show that for every e > 0 there are inhnitely many 
values of n for which 

/(«; k, Z) > (1 - e) 1 /". (8) 

The proof of (8) uses some elementary facts of 
Diophantine approximation and the Chinese Re- 
mainder Theorem. We do not give the details. I 
do not know how much (8) can be improved. By 
a deep theorem of Mahler, using the p-adic Thue- 
Siegel Theorem,/(n; k , /) > n £+l/l . It is quite pos- 
sible that 


«-»• OO Yi 

Interesting problems can also be raised if k tends 
to inhnity with n\ e.g. how large can f(n; k, n(k)) 
become if k = (1 + o(l)) log w? It seems to be dif- 
ficult to write a really short note on the subject 
since new problems occur while one is writing! 


np a,+1 eS*. 

p<k 

Perhaps I am overlooking an obvious explicit con- 
struction for n k but at the moment I do not even 
have good upper or lower bounds for it. When 
is k\ in S k ? . The smallest such k is 8 and I do not 
know if there are infmitely many such k’s. But by 
Wilsons theorem, p\ is never in S p . 

To complete this note, I state three more extremal 
problems in number theory. Put 

n! = II a i, for a x < a 2 < ... < a n . 

Determine max a x . It follows easily from Stirlings 
formula that a x does not exceed j(l- p^). I 
conjectured that for every rj > 0 and sufficiently 
large n , max a x exceeds ~ e (1 - rj). 

Now write 

n! = 17 bj, for 1 < b x <b 2 < ... < b n < n. 
Determine or estimate min k. 

Clearly k exceeds n - n/ log n and by more com- 
plicated methods I can prove 

k-n-n (l+o(l))/log n , 

k> n - n (log n+c)/ (log n ) 2 , 

where c is a positive absolute constant. 

Finally write 

n! = 17 u i, for u x < u 2 < ... < u k- (11) 


It would be of some interest to know how many of 
the integers a k (n + i) must be different. I expect 
that more than cxk are. If this is proved one of 
course must determine the best possible value of c. 

Denote by K(l) the greatest integer below / com- 
posed entirely of primes below k. Trivially 

minma xa k (n + i) = K(l). (10) 

n i 

To prove (10) observe that on the one hand any 
set of / consecutive integers contains a multiple 
of K(l) on the other that if 2/ divides t , then the 
integers t\ + 1, ..., t\ + l clearly satisfy (10), when 
n = 0. More generally, try to characterise the set 
of n which satisfy (10). To simplify matters, let 
k = 1 and denote n k as the smallest positive integer 
with max f a k (n + i) = k , S k as the class of all inte- 
gers n such that this is true. Ifp a ^ is the greatest 
power ofp not exceeding k then 


Determine or estimate min u k , but k is not fixed. 
It is not hard to prove that u k less than 2 n has only 
a finite number of solutions. I only know of two: 

6! = 8 x 9 x 10, 

14! = 16 x 21 x 22 x 24 x 25 x 26 x 27 x 28. 

It would be difficult to determine all the solutions, 
although Vaughan has found some more: 

3 ! = 6 , 

8! = 12 x 14 x 15 x 16, 

15! = 16 x 18 x20 x21 x22 x25 x26 x27 x28, 

and these are all up to 15. Vaughan also tells me 

40! = 42 x44 x45 x48 x49 x50 x51 x52 x 
54 x 55 x 56 x 57 x 58 x 59 x 60 x 62 x 63 x 
64 x65 x66 x68 x69 x72 x74 x80. 
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Can be written as 222 in base 6. Number of 
metals in the periodic table. Happy number. 





The Ultimate Painting 

DropArtists, 1966 


Sum of the squares of the first four primes. 
Sum of the divisors of the first ten integers. 
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A rchimedes was the greatest mathemati- 
cian, possibly the greatest scientist and 
certainly one of the greatest engineers of 
antiquity. Plutarch writes that although his en- 
gineering achievements ‘gave him the renown of 
more than human sagacity, ... heplaced his whole 
affection and ambition in those purer speculations 
where there can be no reference to the vulgar needs 
of life; studies, the superiority of which to all oth- 
ers is unquestioned, and in which the only doubt 
can be, whether the beauty and grandeur of the 
subjects examined, or the precision and cogency of 
the methods and means ofproof, most deserve our 
admiration! 

Archimedes hnal triumph as an engineer was the 
defence of Syracuse (in 212 BC) when C such terror 
seized the Romans, that, if they did but see a lit- 
tle rope or apiece of wood from the wall, instantly 
crying out, that there it was again, Archimedes was 
about to let fly some engine at them, they turned 
their backs and fled\ However, the Romans even- 
tually prevailed and he died in the sack of the 
city. The Romans, who organised the destruction 
of more people of every race, religion and col- 
our than any empire before, were always a little 
ashamed of killing the greatest mind of antiquity 
and invented several hne stories about his death. 

The writings of Archimedes were collected, cop- 
ied and expounded for the next 1500 years but, 
although some of those who studied them cer- 
tainly understood them, they do not seem to 
have progressed much beyond him. One reason 
for this may have been expressed by Cicero, who 
was proud of restoring the tomb of the great man. 


‘Among them [the Greeks] geometry was held in 
highest honour; nothing was more glorious than 
mathematics. But we [the Romans] have limited the 
usefulness ofthis art to measuring and calculating! 
(Or, as EPSRC might put it, ‘shaping capability’.) It 
is perhaps, not surprising that, although the Ro- 
mans produced much bigger and somewhat better 
versions of existing technologies, they produced 
little that was entirely novel. Another reason for 
the lack of progress may have been the channel- 
ling of Greek abstract thought into the of endless 
marshes of Christian theological controversy. 

The fall of the Eastern Roman Empire, ending 
with the sack of Constantinople in 1492, result- 
ed in the loss of an enormous number of Greek 
manuscripts. Which books survived seems to 
have been mainly a matter of luck. Some of Ar- 
chimedes’ works survived as earlier translations 
into Arabic, but most of what survived was in two 
manuscripts which found their way into the pos- 
session of the Norman kings of the Two Sicilys 
and then into the Yatican library. Both have since 
disappeared, but not before they were translated 
into Latin hrst by William of Markab in 1296 and 
then by James of Crewman in 1544. 

The invention of printing meant that Crewmans 
translation could be widely distributed and Archi- 
medes became a hero and a source of inspiration 
to early scientists like Kepler and Galileo. Archi- 
medes showed them that mathematics could be 
used not merely to study the heavens (which had 
always had an ethereal and so mathematical feel) 
but everyday things like boats Aoating in water. 
Newton wrote his Principia in the Greek (that is 
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to say, the Archimedian) style, making a dithcult 
book even more dithcult, but showing the respect 
due from one mage to another. 


The new calculus of Newton and Leibniz meant 
that any fool (measured on the Newton and Ar- 
chimedes scale) could hnd results which up then 
had required the genius of Archimedes and his 
direct inhuence on science came to an end. 


In 1906, a Danish classical scholar named Heiberg 
realised that a prayer book held in Istanbul was 
written on reused vellum and that the original text 
(which had, of course, been carefully scraped off) 
was a collection of Archimedean works. Working 
from photographs, he was able to recover most of 
this text. Several of the works were known in one 
form or another but one now called The Method of 
Mechanical Theorems created a sensation. 


It begins with words that still thrill me many years 
after I hrst read them: ‘Archimedes to Eratosthenes 
greeting. [...] Seeing moreover in you, as I say, an 
earnest student, a man of considerable eminence 
in philosophy, and an admirer [of mathematical 
enquiry], I thought fit to write out for you [...] the 
peculiarity ofa certain method, by which it will be 
possible [...] to imestigate some of the problems in 
mathematics by means of mechanics. This proce- 


AK( :ii I Mki5I S 
PALCMPSEST 



In 1998 it reappeared, further damaged, in part, by neglect and, in part, 
by a criminal attempt at forgery, and was sold at auction to an anony- 
mous American for a mere two million dollars. Fortunately it was now 
in the charge of someone who knew its true value. The most modern 
scientihc techniques have been used to study it and the results are now 
issued in two beautiful volumes by CUP. (More technical and scholarly 
volumes will follow.) The hrst volume gives the background to the stud- 
ies and the second images of the restoration itself. 


From the point of view of the mathematician, little more is revealed 
than was known through the work of Heiberg. However classicists were 
thrilled by the discovery of several speeches of Hypereides (one of the 
major Greek orators), a commentary on Aristotle and several as yet uni- 
dentihed fragments. 


The Archimedes Palimpsest 
ISBN-13: 9781107014572 
$140.00 




dure is, I am persuaded, no less useful evenfor the 
proofofthe theorems themselves; for certain things 
first became clear to me by a mechanical method, 
although they had to be demonstrated by geometry 
ajterwards because their imestigation by the said 
method did notfurnish an actual demonstration! 


In other words, the great magician will draw back 
the curtain and reveal his secrets. And those se- 
crets turn out to be tremendous - not quite the 
modern calculus of Newton and Leibniz but cer- 
tainly containing many of the ideas, painfully 
discovered by their predecessors, which underlie 
that calculus. Alternative history is a mugs game, 
but it is hard not to feel that, if Galileo or Kepler 
had held The Method in their hands, Western 
science would have been advanced by hfty years. 
Archimedes concluded his introduction with the 
words. T am persuaded that [this method] will be 
ofno little service to mathematics; for I apprehend 
that some, either of my contemporaries or of my 
successors, will, by means ofthe method when once 
established, be able to discover other theorems in 
addition, which have not yet occurred to me! But 
it was not to be. 


Naturally scholars returned to Istanbul to look at 
the original prayerbook, only to hnd it had disap- 
peared! 




It is unlikely that many people will fork out 150 pounds to buy these two 
volumes, particularly since one of them is in ancient Greek. But those 
who do will own a triumph of the art of making books, a triumph of 
the ability of modern science to make darkness visible, a triumph in the 
classicist s six hundred year struggle to to restore the wisdom of the an- 
cients and a monument to a man who more than two thousand years ago 
helped lay the foundations of the modern world. ‘History is indeed little 
more than the register of the crimes, follies and misfortunes of mankind’ 
but occasionally we get a glimpse of something better. These volumes 
are, as it were, the concentrated essence of civilisation. 






















Book Reviews 



Mathematics: A Very Short Introduction 

Timothy Gowers Oxford University Press, 2002 

ISBN 978-0-19-285361 -5 £7.99 

A thoroughly entertaining little book that lends itself well to casual 
reading, and which justihes its title wonderfully. While mathemat- 
ics students may hnd the concepts rather familiar or basic, Gowers 
lucid style and simple examples make the content accessible to all. 

Encouraging the reader to think abstractly, the book touches on 
topics such as fractional dimension, hyperbolic geometry and 
uses of mathematical models. Its essentially independent chapters 
can be read separately, but at the same time are neatly unihed by 
the underlying philosophical flavour. Some may also hnd a Fields 
Medallists responses to oft-asked questions including “Is it true 
that mathematicians arepast it by the time they are 30?” and “Why 
are there so few women mathematicians?” intriguing, in the last 
chapter. 


This is an insightful bridge between the mathematics taught at 
school and what aspiring students can look forward to, and is rec- 
ommended for anyone with an interest in the subject. Stacey Law 



HARTIN REE 


Just Six Numbers 

Sir Martin Rees Phoenix, 2001 

ISBN: 978-0-75-381022-4 £8.99 

Disregarding how abstract the topic is, a good mathematics book 
should be understood at some level by any reader. When I started 
my undergraduate course I just understood the basics from this 
book, that there are six main numbers that dehne cosmology: the 
number of dimensions we live in, the ratio of the strength of grav- 
ity to that of electromagnetism, c, the ratio of mass lost to energy 
when hydrogen is fused to form helium, fl, describing the amount 
of dark matter, A, the cosmological constant, and Q, related to the 
scale at which the universe looks smooth. 

In time I understood the rest of the book. It is a really good book 
to start with, since Martin Reese has managed to explain the key 
ideas behind cosmology today in 180 pages without any “fuss” 
equations. And it proves that cosmology can be done while having 
a nice cup of tea. Carina Negreanu 
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Unitary pertect number and Pronic number. Latitude of 
the North and South poles. Length of a soccer match. 









Proof of Death 

Chris Pearson KindleeBook 

ASIN: B008U8R20K £0.99 

From a grim scene of hostages in Chechnya to the Great Clock 
in Trinity College, Pearson creates a web of mystery around the 
hctional proof of the Riemann Hypothesis. Weaving together vari- 
ous locations and plots to keep you avidly reading to the end, his 
thriller cleverly incorporates both number theory and its applica- 
tion to cryptography. The plot is skihully designed so that math- 
ematicians and non-mathematicians alike are sucked into Aslans 
world of survival and deceit. 

“A prime number, of course - no divisors except itself and one - al- 
ways yielded the best brew” Packed with emotion and description, 
this book is sure to provide a hctional world of mathematical proof 
that any reader can easily delve into. Eleanor Wale, Reading 



Un Sitm-rtH 


ryshiTin 


Algebraic NumberTheory and 
Fermat f s Last Theorem 

lan Stewart, David Tall AKPeters, 2002 

ISBN 1-56881-119-5 £37.99 

It is dithcult to hnd a mathematics book that is both precise and in- 
formal. This book has both qualities, giving historical background 
information while rigorously developing algebraic number theory. 
It is suitable for undergraduates meeting the subject for the hrst 
time. Dehnitions are motivated and important concepts are illus- 
trated by computational examples. 

The material in the hrst 10 chapters is approximately equivalent 
to the Part II Number Fields course, landmarks being ideals, 
Minkowskis Theorem, and the class-group. The remaining chap- 
ters contain the proof of a special case of Fermat's Last Theorem 
(regular prime exponents), which uses all the previously intro- 
duced ideas. They also touch on more advanced topics leading up 
to a sketch proof of its general version. 

The extra material on elliptic curves and elliptic functions has lit- 
tle to do with the rest of the book and feels a bit disconnected. 
However, the chapters on algebraic number theory are excellent 
for accompanying a university course, while the last part will whet 
the readers appetite for more. Philipp Kleppmann 


Smallest non-trivial cabtaxi number. Smallest 
pseudoprime satistying 3 n = 3 mod n. 
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Archimedeans 

Christmas Catalogue 


f Klein Salad Dressing Bottle 

Keep oil on the inside, 
vinegar on the outside 


£20 



Ouantum Surfboard 

Don't use on unrestricted 
wavefunctions 


£35 



Random Walk Generator 

Comprises stereotypical mathema- 
tician and half-pint of larger. 


FREE 


Calculus-removing 
■w toothpaste 

Guaranteed opague. 


£5 



3D Random Walk Generator 

Also includes centrituge and 
trampoline. 

Accessories: reflecting barriers, absorb- 
ing barriers, extra-absorbing barriers 
for mopping up resulting spills. 


£99 


rlj Anthropomorphiser 

_ Ascribes human qualities and 

| ji I emotions to functions, sets, 
i ® numbers etc. Not for use on 
mathemat icians! 


£i 



Epsilon Magnifier 

Sick of struggling with tiny 
epsilons? The revolutionary new 
epsilon magnitier simplities analy- 
sis by increasing all epsilons to 
values > 1. 


£6 



Set of Pathological Cases 

For the more experienced 
traveller, save money with our 
nowhere-dense set of luggage. 


£40 



The Escher Machine 

The ball rolling down an infinite 
slope generates enough energy 
to power a light bulb. 

NEW! Uphill version: uses two 
1.5VAA batteries per day. The 
ideal gift for someone you dislike. 


£ oo 


All items in our catalogue can be 
ordered by writing to 

The Archimedeans 
O Kolmogorov Street 
XI024 Cantortown 


Irwented by Chris Cummins, Eureka 56 
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Number of "atomic elements" in the Look-and-say 
seguence. Atomic number of Uranium. 










































Compiled byCJ Budd, Eureka 43 

Hereyou see three different dehnitions forsome obscure mathematical terms. 

It is your task to f\nd - with justihcation - the correct one. Solutions are on page 94. 


(1) This point was discovered by that well- 
known mathematician NicolasTarry. Given 
an earth-moon-sun-spaceship system, 
the Tarry point is that point where a body 
would be in equilibrium. 

(2) The Tarry point of a triangle is the point 
on its circumcircle opposite to its Steiner 
point- the point of intersection of the lines 
through vertices of the triangle parallel to 
the corresponding sides of the first Brocard 
Triangle. The vertices of the Brocard triangle 
are on the points of intersection of the 
lines from the vertices of the triangle to the 
Brocardpoints X and Y. These are such that 
^XAV = aXAC and ^YBA = ^YAC. 

(3) Given a dynamical system, theTarry 
point is the point at which the rate of 
growth ceases to be exponential - although 
polynomial growth is still permitted. 


(1) Asystem oforganising movementsin a 
Bridge tournament, formalised by Mr Room. 

(2) A dissection of a square into smaller 
squares of different side lengths ('rooms'), 
discovered by Trinity students in 1939. 

(3) Tile a cuboid in R n regularly by sub- 
cuboids. Mark certain faces, ensuring that 
no subcuboid has >3 faces marked.Then if 
it is possible to go from one subcuboid to 
another entirely by marked faces, we have a 
room design, where the subcuboids are the 
rooms and the marked faces are the doors. 


UnoerWronslotio^^^ 

1 (1) A device discovered byUngerand 
widely used in the engineering industry. It 
transforms a problem in potential theory to 
another which may be easier to solve. 

(2) UngersTranslation can transform a 
series of simultaneous nonlinear partial 
differential equations to non-Euclidean 
geometry, where it looks prettier even if it 
still may be insoluble. 

(3) Ungers'translation is, of course, a trans- 
lation by Unger of a work by Hilbert. 


fl AAouse 


(1) On each face of a tetrahedron construct 
another tetrahedron of side 1/3 of the origi- 
nal. Continue this process for ever. What 
you end up with is a mouse: a finitely small 
yet infinitely furry little animal. 

(2) A mouse is, naturally, a subset of a Cat, 
a connected absorbing topology! A mouse 
is any subset of a cat which has a tail (i.e. 

a proper one dimensional subset. This tail 
must of course be unique and no two mice 
are permitted to have the same tail. 

(3) A premouse is an admissable set with 
an ultrafilter which thinks the ordering it 
gets from the ultrafilter is a well-ordering. 

If the ordering is close enough to let us 
iterate on the ultrafilter we have an iterable 
premouse. If this is well behaved we have 

a critical iterable premouse. A mouse is a 
critical iterable premouse for which every 
sub-premouse is also critical. 


Number of sets with 8 integers so that each is a 
proper divisor of the product of the others plus 1. 
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Solutions 


Archimedeans Problems Drive 

1 Dazzling Dice 

Most likely is 6, with probability 16807/46656. 

2 SnappySurds 

20, 28 and 100. 

3 Painful Primes 

999 917 is prime. 

4 Compelling Convergence 

(a) diverges; 

(b) converges to n 2 /6; 

(c) converges to tanh _1 (log 2) - log 2. 

5 SuperbSets 

(a) is countably inbnite; 

(b) is bnite of size 1; 

(c) is uncountable as it bijects with M; 

(d) is uncountable as it bijects with ®L 

6 TriumphantTreasures 

Tbe treasure is buried on the moon. 

7 CuriousCoins 

You want to go brst for all n. 

8 Perceptive Polygons 

Yes! 

9 TerribleTriangles 

8 2^/3 \ 

19 5 l9~/' 


10 Rough Relations 

f 0 n <3 

f(n) = l 2 n = 4 

+ 1) n > 5 

11 Gorgeous Geometry 

Line perpendicular to OD. 

12 Mysterious Matchings 

2 

13 Dazzling Digits 

4000 


14 Cryptic Crossword 

Tbe treasure is buried on the moon. 



Call My Bluff 

Tbe Tarry Point: Debnition 2 Unger s Translation: Debnition 3 

A Room Design: Debnition 1 A Mouse: Debnition 3 
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Number of graphic non-whitespace characters 
in ASCII. Smith number. 94! - 1 is prime. 
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